CONTINUITY, CURVATURE, AND THE GENERAL 
COVARIANCE OF OPTIMAL TRANSPORTATION 

YOUNG-HEON KIM AND ROBERT J. MCCANN 

Abstract. Let M and M be n-dimensional manifolds equipped with suitable 
Borel probability measures p and p. Ma, Trudinger & Wang gave sufficient 
conditions on a transportation cost c £ C 4 [M xM) to guarantee smoothness of 
the optimal map pushing p forward to p\ the necessity of these conditions was 
deduced by Loeper. The present manuscript shows the form of these conditions 
to be largely dictated by the covariance of the question; it expresses them via 
non-negativity of the sectional curvature of certain null-planes in a novel but 
natural pseudo-Riemannian geometry which the cost c induces on the product 
space M x M. 

Holder continuity of optimal maps was established for rougher mass distri- 
butions by Loeper, still relying on a key result of Trudinger &; Wang which re- 
quired certain structure on the domains and the cost. We go on to develop this 
theory for mass distributions on differentiable manifolds — recovering Loeper's 
Riemannian examples such as the round sphere as particular cases — give a 
direct proof of the key result mentioned above, and revise Loeper's Holder con- 
tinuity argument to make it logically independent of all earlier works, while 
extending it to less restricted geometries and cost functions even for subdo- 
mains M and M of R" . We also give new examples of geometries satisfying the 
hypotheses — obtained using submersions and tensor products — and some 
connections to spacelike Lagrangian submanifolds in symplectic geometry. 



1. Introduction 

Let M and M be Borel subsets of compact separable metric spaces, in which 
their closures are denoted by clM and clM. Suppose M and M are equipped 
with Borel probability measures p and p, and let c : cl(M x M) — > R U {+00} 
be a lower semicontinuous transportation cost defined on the product space. The 
optimal transportation problem of Kantorovich |22j is to find the measure 7 > 
on M x M which achieves the infimum 

(1.1) W c (p,p) := min / c(x, x)dj(x, x). 

l€X{p,p) J MxM 

Here T(p, p) denotes the set of joint probabilities having the same left and right 
marginals as p <g> p. It is not hard to check that this minimum is attained; any 
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minimizing measure 7 £ T(p,p) is then called optimal. Each feasible 7 G T(p,p) 
can be thought of as a weighted relation pairing points x distributed like p with 
points x distributed like p; optimality implies this pairing also minimizes the average 
value of the specified cost c(x, x) of transporting each point x to its destination x. 

The optimal transportation problem of Monge |30j amounts to finding a Borel 
map F : M — ► M, and an optimal measure 7 vanishing outside Graph(i ? ) := 
{(x,x) € M X M I x = F(x)}. When such a map F exists, it is called an optimal 
map between p and p; in this case, the relation 7 is single- valued, so that p-almost 
every point x has a unique partner x = F(x), and optimality can be achieved in 



(1.1 1 without subdividing the mass at such points x between different destinations. 
Although Monge's problem is more subtle to solve than Kantorovich's, when M is a 
smooth manifold and p vanishes on every Lipschitz submanifold of lower dimension, 
a twist condition ((Al), Definition |2.1| below) on the cost function c[x, x) guarantees 
existence and uniqueness of an optimal map F, as well as uniqueness of the optimal 
measure 7; see Gangbo [18] . Gangbo & McCann [19], Carlier [9], Ma, Trudinger & 
Wang [27 , or the reviews in [TD] [JT]. One can then ask about the smoothness of 
the optimal map F : M — > M. 

For Euclidean distance squared c(x, x) — \x — x\ 2 /2, this regularity question was 
resolved using geometric ideas by Caffarelli [S] [B] [7], and also by Delanoe in the 
plane [14] and by Urbas in higher dimensions [36 a , who formulated it as an oblique 
boundary value problem and applied the continuity method with a priori estimates. 
Convexity of M C R" necessarily plays a crucial role. Delanoe investigated reg- 
ularity of optimal transport with respect to Riemannian distance squared on a 
compact manifold [23], but completed his program only for nearly flat manifolds 
[15] . an improvement on Cordero-Erausquin's result from the torus |llj . though his 
criterion for nearness to flat depends on the measures p and p. Under suitable con- 
ditions on p and p and domains M and M C R", Ma, Trudinger & Wang [27] [M] 
developed estimates and a continuity method approach to a class of cost functions 
c e C 4 (M x M) which satisfy a mysterious structure condition comparing third 
and fourth derivatives. Adopting the notation defined in the following section, they 
express this condition in the form 

(1.2) E (~%M + c ir aC ab c m )c^c T h t P 1 qeq f >C\p\ 2 \q\ 2 if p±q, 

for some constant C > and each pair of orthogonal vectors p,q 6 R™. Here 
summation on a, b, e, /, k, I is implicit but the sum on i, j is written explicitly for 
consistency with our later notation. Loeper [26] confirmed their structure condition 
to be necessary for continuity of F, as well as being sufficient for its smoothness [27] 
|34j |35j . Loeper furthermore offered a direct argument giving an explicit Holder 
exponent for F, largely avoiding the continuity method, except that it relied on cen- 
tral results of Delanoe, Loeper, Ma, Trudinger, and Wang, to establish key technical 
lemmata on compact Riemannian manifolds [15 [27 such as the sphere 17], and 
on Euclidean domains 134 . 
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Although the tensorial nature of condition (1.2 1 was known to both Trudinger 
and Loeper, it was not emphasized in their manuscripts Q Moreover, the Euclidean 
structure provides a deceptive identification between vectors with covectors which 



may obscure the geometrical content of (1.2 1, by suggesting that the Euclidean 
orthogonality of two vectors may be relevant, rather than orthogonality of a vector 
and covector pair. On the other hand, the Euclidean metric plays no role in the 
question being addressed, namely the smoothness of maps F : M — ► M which are 
optimal for the cost c. 

Our present purpose is first to extend the theory of Loeper (and in principle, 
of Ma, Trudinger & Wang) to the transportation problem set on a pair of smooth 
manifolds M and M , by finding a manifestly covariant expression of Ma, Trudinger 



& Wang's structure condition (1.2 1, as the sectional curvature non-negativity of 
certain null planes in a pseudo-Riemannian metric on M x M explored here for 
the first time; and second to give an elementary and direct geometrical proof of 
the key ingredients which Loeper requires, Theorems |3.1| and |A.10| below, logi- 
cally independent of the methods and results of Delanoe [15], Delanoe & Loeper 
[T7] , Ma, Trudinger & Wang [57], Trudinger & Wang [33], or their subsequent 
work [35 . This makes Loeper's proof of Holder continuity of optimal maps self- 
contained, including maps minimizing distance squared between mass distributions 
whose Lebesgue densities satisfy bounds above for p(-) and below for p(-) on the 
round sphere M — M = S n . As a byproduct of our approach, we are able to relax 
various geometric hypotheses on M, M and the cost c required in previous works; 
some of these relaxations were also obtained simultaneously and independently by 
Trudinger & Wang using a different approach [35] , which we learned of while this 
paper was still in a preliminary form |25j . 

An important feature of our theory is in its geometric and global nature. In 
combination with our results from |24j . this allows us to extend the conclusions of 
the key Theorems 3.1 and A. 10 to new settings, such as the Riemannian distance 



squared on the product M = M = S" 1 x • • • x S" fc x R' of round spheres, or 
Riemannian submersions thereof. This is a genuine advantage of our work over 



previous approaches [33] [35J [25]; see Example 3.9 and Remark 3.10 

Since terms like c-convex, c-subdiffcrcntial, and notations like d c u are used incon- 
sistently through the literature, and because we wish to recast the entire conceptual 
framework into a pseudo-Riemannian setting, we often depart from the notation and 
terminology developed by Ma, Trudinger, Wang and Loeper. Instead, we have tried 
to make the mathematics accessible to a different readership, by choosing language 
intended to convey the geometrical structure of the problem and its connection to 



1 Personal communications with Neil Trudinger and Gregoire Loeper. This observation is made 
en passant in |33 | , while an assertion and proof of this fact has been included in the revised version 
of 1201 . which was communicated to Villani and recorded in |41| independently of the present work. 
Although Loeper was motivated to call the expression appearing in < | I ■-'[ ) a cost-sectional curvature 
by his discoveries in the Riemannian setting (Example |3,6| , it does not seem to have been noted 
by previous authors that the manifest covariance of the regularity question largely dictates the 
form of its necessary and sufficient answer, by requiring this answer to be expressible in terms of 
coordinate independent quantities such as geodesies and curvature. 
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classical concepts in differential geometry not overly specialized to optimal trans- 
portation or fully nonlinear differential equations. This approach has the advantage 
of inspiring certain intuitions about the problem which are quite distinct from those 
manifested in the previous approaches, and has a structure somewhat reminiscent 
of symplectic or complex geometry. Although we were initially surprised to discover 
that the intrinsic geometry of optimal transportation is pseudo-Ricmannian, with 
hindsight we explain why this must be the case, and make some connections to the 
theory of Lagrangian submanifolds in the concluding remarks of the paper. 

The outline of this paper is as follows. In the next section we introduce the 
pseudo-Riemannian framework and use it to adapt the relevant concepts and struc- 
tures from Ma, Trudinger & Wang's work on Euclidean domains to manifolds whose 
only geometric structure arises from a cost function c : M x M — ► R U {+00}. 
Since Morse theory prevents a smooth cost from satisfying the desired hypothesis 
(Al) on a compact manifold, we deal from the outset with functions which may 
fail to be smooth on a small set — such as the cut-locus in the Riemannian setting 
P5] [25], see Example 3.6 or the diagonal in the reflector antenna problem [ST] [13] 
[5], see Example 3.5 This is followed by Section [3j where we motivate and state 
the main theorem proved here: a version of Loeper's global geometric characteriza- 
tion of (1.2 1 which we call the double-mountain above sliding-mountain maximum 
principle. In the same section we illustrate how this theorem and the pseudo- 
Riemannian framework shed new light on a series of familiar examples from Ma, 
Trudinger, Wang and Loeper, including those discussed above, and new ones formed 
from these by quotients and tensor products of, e.g., round spheres of different sizes, 
in Example 3J5 Section [4] contains the proofs which relate our definitions to theirs 
and establish the main theorem. A level set argument is required to handle the 
more delicate case in which the positivity in (1.2) is not strict. A last section offers 
some perspective on these results and their connection to optimal transportation. 
We have included a series of appendices which give a complete account of Loeper's 
theory of Holder continuity of optimal mappings, illustrating how our main result 
makes this theory self-contained, and simplifying the argument at a few points. 
In particular, we give a unified treatment of the Riemannian sphere and reflector 
antenna problems, using the fact that the mapping is continuous in the former to 
deduce the fact that it avoids the cut-locus [17], instead of the other way around. 

It is a pleasure to thank Neil Trudinger, Xu-Jia Wang, Gregoire Loeper, and 
Cedric Villani for many fruitful discussions, and for providing us with preprints 
of their inspiring works. Useful comments were also provided by John Bland, 
Robert Bryant, Philippe Delanoe, Stamatis Dostoglou, Nassif Ghoussoub, Gerhard 
Huisken, Tom Ilmanen, Peter Michor, Truyen Nguyen, and Micah Warren. We 
thank Jeff Viaclovsky for helpful references, and Brendan Pass for a careful reading 
of the manuscript. We are grateful to the Mathematical Sciences Research Institute 
in Berkeley, where parts of this paper were written, and to Adrian Nachman, Jim 
Colliander, and the 2006-07 participants Fields Analysis Working Group, for the 
stimulating environment which they helped to create. 
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2. PSEUDO-RlEMANNIAN FRAMEWORK 

Fix manifolds M and M which, if not compact, are continuously embedded in 
separable metrizable spaces where their closures clM and clM are compact. Equip 
M and M with Borel probability measures p and p, and a lower semicontinuous 
cost function c : cl(M x M) — ► R U {+00}, and a subdomain N C M x M of 
the product manifold. Visualize the relation N as a multivalued map and denote 
its inverse by N* := {(x,x) \ (x,x) G N}. We call iV(x) := {x G M \ (x,x) G iV} 
the set of destinations visible from x, and N(x) :— {x G M \ (x, x) G iV} the set of 
sources visible from 2. We define the reflected cost c*(x, x) := c(x, x) on M x M. In 
local coordinates x 1 , . . . ,x n on M and x 1 , . . . ,x n on M, we use the notation such 
as Cj = dcfdx 1 and = dcjdx 1 to denote the partial derivatives -Dc = (ci, . . . , c„) 
and Dc = (cj, . . . , c^) of the cost, and = d 2 c/ dx^ dx l to denote the mixed partial 
derivatives, which commute with each other and form the coefficients in the n x n 
matrix DDc. When cq is invertible its inverse matrix will be denoted by c 7 . The 
same notation is used for tensor indices, with repeated indices being summed from 
1 to n (or n + 1 to 2n in the case of barred indices), unless otherwise noted. 

Let T X M and T*M denote the tangent and cotangent spaces to M at x. Since 
the manifold N C M x M has a product structure, its tangent and cotangent spaces 
split canonically: T {x ^)N = T X M © T X M and TT^N = T*M © T|M. For c(x, x) 
sufficiently smooth, this canonical splitting of the one-form dc will be denoted by 
dc = Dc © Dc. Similarly, although the Hessian of c is not uniquely defined until 
a metric has been selected on N, the cross partial derivatives DDc at (a;, x) G N 
define an unambiguous linear map from vectors at x to covectors at x; the adjoint 
(DD C y = DDc of this map takes T X M to T* X M. Thus 

-DDc \ 
-DDc J 

gives a symmetric bilinear form on the tangent space T^ X ^N to the product. Let 
us now adapt the assumptions of Ma, Trudinger & Wang [57] [Mj to manifolds: 
(AO) (Smoothness) c G C 4 (iV). 

Definition 2.1. (Twist condition) A cost c G C 1 (N) is called twisted if 
(Al) for all x G M the map x — ► —Dc{x, x) from N(x) C M to T*M is infective. 
If c is twisted on N C M x M and c*(x,x) = c{x,x) is twisted on N* = {(S, x) \ 
(x,x) G N} we say c is bi-twisted. 

Definition 2.2. (Non-degeneracy) A cost c G C 2 (N) is non-degenerate if 
(A2) for all (x,x) G N the linear map DDc : T X M — > T*M is bijective. 

Though ( Al) will not be needed until the appendices (and must extend to cl N 
there), for suitable probability measures p and p on M and M the twist condition 
alone is enough to guarantee the Kantorovich infimum ( |1.1[ ) is uniquely attained, 
as well as existence of an optimal map F : M — ► M [TH] [3] [57], as reviewed in 
[41] [TO] . It implies the dimension of M cannot exceed that of M, while (A2) forces 
these two dimensions to coincide. The non-degeneracy condition (A2) ensures the 
map x — ► —Dc(x,x) acts as a local diffeomorphism from N(x) C M to a subset 




6 



YOUNG-HEON KIM AND ROBERT J. MCCANN 



of T*M (which becomes global if the cost is twisted, in which case its inverse is 
called the cost- exponential |26| . Definition |4.3| below), and that h(-,-) defined by 



(2.1 1 is a non-degenerate symmetric bilinear form on Tr x ^\N. Although h is not 



positive-definite, it defines a pseudo-Riemannian metric on TV, which might also 
be denoted by di 2 = —cqdx % dxK The signature of this metric is zero, since in 
any choice of coordinates on M and M, the eigenvalues of h come in ±A pairs due 



to the structure (2.1); the corresponding eigenvectors are p © p and (— p) p in 
T(x,x)N = T X M © T S M. Non-degeneracy ensures there are no zero eigenvalues. A 
vector is called null if h(p © p,p © p) = 0. A submanifold £ C N is called null if 
all its tangent vectors are null vectors, and totally geodesic if each geodesic curve 
tangent to S at a point is contained in X locally. The submanifolds {x} x N(x) and 
N(x) x {x} are examples of null submanifolds in this geometry, and will turn out 
to be totally geodesic. Assuming c S C (N), we can use the Riemann curvature 
tensor Ri'j'k'l' induced by h on N to define the sectional curvature of a two-plane 
P AQ at (x,x) G N: 

2n 2n 2n 2n 

(2.2) sec (x , s) P A Q = secgJJ P^Q=J2J2J2Y1 Rvym<P % ' Q 3 ' P k ' Q 1 ' ■ 

i'=l j'=l k' = l l' = l 

In this geometrical framework, we reformulate the mysterious structure condition 



(1.2 1 of Ma, Trudinger & Wang [27] [M] from the Euclidean setting, which was 
necessary for continuity of optimal maps [28] and sufficient for regularity [27] [3"3] . 
The reader is able to recover their condition from ours by computing the Riemann 



curvature tensor (4.2 1. Note that we do not normalize our sectional curvature 
definition (2.2 1 by dividing by the customary quantity h(P, P)h(Q,Q) — h(P,Q) 2 , 
since this quantity vanishes in the case of most interest to us, namely P = p © 
orthogonal to Q = © p, which means p © p is null. 

Definition 2.3. (Regular costs and cross-curvature) A cost c £ C 4 (N) is 

weakly regular on N if it is non- degenerate and for every {x, x) G N . 

(A3w) sec( XjS ) (p © 0) A (0 © p) > for all null vectors p®pG T( X ^N. 

A weakly regular cost function is strictly regular on N if equality in ( A3w) implies 

p = or p = 0. in which case we say (A3s) holds on N. We refer to the quantity 

appearing in as the cross-curvature, and say a weakly regular cost — and the 

pseudo-metric \2. 1\ it induces on N — are non-negatively cross-curved if 

(2.3) sec( X)i ) (p © 0) A (0 © p) > 

holds for all (x, x) G N and p (Bp € Tr XtS \N , not necessarily null. The cost c and 
geometry (TV, h) are said to be positively cross-curved if in addition, equality in 



(2.3) implies p — or p = . 

If clM CC M' and clM CC M' are contained in larger manifolds and (AO), 
(A2) and (A3s/w) all hold on some neighbourhood N' C M' x M' containing 
N CC N' compactly, we say c is strictly/weakly regular on cl7V. If, in addition 
(Al) holds on N' , we say c is twisted on cl N. 

The nullity condition on p © p distinguishes weak regularity of the cost from 
non-negative cross-curvature: this distinction is important in Examples |3.5|and|3.9| 
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among others; see also Trudinger & Wang |34j . Non-negative cross-curvature is in 
turn a weaker condition than sec^ Ar,/l ^ > 0, which means seCf XtS \(p®q) A(q®p) > 
for all (x, x) £ N and p © q, q © p £ T^ ^N. As a consequence of Lemma 4.1 and 
due to the special form of the pseudo-metric, sec^ N ' h ^ > is equivalent to requiring 
non-negativity of the cross-curvature operator as a quadratic form on the vector 
space T X M A T X M, i.e., 

(2.4) fltffcripV ~ '/V'H/'V - q k q l ) > 0. 

Example 2.4 (Strictly convex boundaries). Let il C R ,l+1 and A c R' i+1 be 
bounded convex domains with C 2 -smooth boundaries. Set M — d£l, M = dA, 



and c(x,x) = \x — x\ /2. We claim the pseudo-metric (2.1 1 is non-degenerate and 
that sec (JV '' l) > on N := {(x,x) £ dtt x dA \ n n {x) ■ h A (x) > 0}, where h n (x) 
denotes the outer normal to f2 at x. Indeed, fixing (x,x) £ N, parameterize M 
near x as a graph X £ R" — > (X, f(X)) £ d£l over the hyperplane orthogo- 
nal to nn(x) + ua(x), and M near x by a convex graph X £ R™ — > (A, 5(A)) 
over the same hyperplane. This choice of hyperplane guarantees |V/(A)| < 1 and 
|V<?(A)| < 1 nearby, so in the canonical coordinates and inner product on R", 
(A2)— (A3w) follow from a computation of Ma, Trudinger & Wang [27] which 
yields the cross-curvature 

(2.5) secgWfreO) A (0 (Bp) = fa^tf 9$?) I {* + 2V/ • Vg) > 0. 



In fact, we can also deduce the stronger conclusion sec^ N ' h ^ > as in (2.4 1: 
(2 + 2V/ • Vg) secj^ (p © g) A (g © p) 

= ( P D 2 fp)(pD 2 gp) + (qD 2 fq)(qD 2 gq) - 2(pD 2 fq) (pD 2 gq) 
(2.6) > (y/{pD*fp)($D*gp) - V(qD 2 f<l)(qD 2 gq)) ' . 

Noting n ((A,/(A))) = (V/(A),-1), Ma, Trudinger & Wang's computation 
shows nondegeneracy (A2) fails at the boundary of N where ha(x) ■ h\(x) = 



implies the denominator of (2.5 1 is zero. Gangbo & McCann [20] showed the cost 
is twisted on N provided A is strictly convex, but cannot be twisted on any larger 
domain in M x M. If both f2 and A are 2-uniformly convex, meaning that the Hes- 



sians D f and D g are positive definite, the conditions for equality in ( 2.6 ) show the 
sectional curvature of (N, h) to be positive. The resulting strict regularity (A3s) 
underlies Gangbo & McCann's proof of continuity for each of the multiple map- 
pings which — due to the absence of twisting (Al) — are required to support the 
unique optimizer 7 £ T(p, p) in this geometry. Here the probability measures p and 
p are assumed mutually absolutely continuous with respect to surface measure on 
£1 and A, both having densities bounded away from zero and infinity. For contrast, 
observe in this case that the same computations show that although nondegeneracy 
(A2) also holds on the set A_ := {(x, x) £ dSl x dA \ na(x) ■ n\(x) < 0}, this time 
both surfaces can be expressed locally as graphs over hn{x) — h\{x) but (A3w) 
fails at each point (x,x) £ A_: indeed, the cross-curvatures of A_ are all negative 
because D 2 f > > D 2 g have opposite signs. 
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Let us now exploit the geodesic structure which the pseudo-metric h induces 
on N C M X M to recover Ma, Trudinger & Wang's notions concerning c-convex 
domains |27] in our setting. 

Definition 2.5. (Notions of convexity) A subset W C N c M x M is geodesi- 
cally convex if each pair of points in W is linked by a curve in W satisfying the 
geodesic equation on (N,h). This definition is extended to subsets W C clN by 
allowing geodesies in N which have endpoints on ON. We say Cl C clM appears 
convex from x G M if {x} x Cl is geodesically convex and Cl C clN(x). We say 
W C M x M is vertically convex ifW(x) := {x G M \ (x,x) G W} appears convex 
from x for each x € M . We say Q C cl-M appears convex from x G M iffl x {x} is 
geodesically convex and fl C cl7V(a;). We say W C M x M is horizontally convex 
if W{x) := {x G M | (x, x) S W} appears convex from x for each x G M . If W is 
both vertically and horizontally convex, we say it is bi-convex. 



For a non-degenerate twisted cost (AO)— (A2), Lemma 4.4 shows £1 C N(x) 
appears convex from x if and only if Dc(x,£l) is convex in T*M; similarly for a 
bi-twisted cost f2 C N(x) appears convex from x if and only if Dc(£l, x) is convex in 
T^M. This leads immediately to notions of apparent strict convexity, and apparent 
uniform convexity for such sets, and shows our definition of apparent convexity 
is simply an adaptation to manifolds of the c-convexity and c*-convexity of Ma, 
Trudinger & Wang |27j : is c* -convex in their language with respect to x if it 
appears convex from x; Cl is c-convex with respect x if it appears convex from x; 
and and f2 are c*- and c-convex with respect to each other if N — Q, x Cl C R 2 " 
is bi-convex, meaning Dc(x, N(x)) C T*M and Dc(N(x),x) C T£M are convex 
domains for each (x, x) G N. 

Remark 2.6. In dimension n = 1, strict regularity (A3s) follows vacuously from 
non-degeneracy (A2), since p ® p null implies p — or p = 0. For this reason 
we generally discuss n > 2 hereafter. Note however, for n = 1 and c G C 4 (N) 
non-degenerately twisted, N is bi-convex if and only if N(x) and iV(x) are homeo- 
morphic to intervals. In local coordinates x 1 and x 1 on TV, non-deg enera cy implies 
en = =pe ±A ^ ,:r '. Comparing -cmi + c 11 jc 11 c 1 ij = Axi|cu| with (4.2| shows 



(2.7) c(x 1 ,x 1 ) = T / / e ±x( - s ^dsdt 



induces a pseudo-metric h on N for which secr X)i ) (p © Q) A (0©p) has the same sign 
as d 2 X/dx 1 dx 1 whenever p ^ ^ p. If (AT, ft,) is connected its cross-curvature will 
have therefore a definite sign if A(x, x) is non-degenerate, and a semidefinite sign 
if A(x, x) is twisted. Moreover, the sign of the cross-curvature, sectional curvature, 
and curvature operator all coincide on a product of one-dimensional manifolds, 
although this would not necessarily be true on a product of surfaces or higher- 
dimensional manifolds. 

Definition 2.7. (c-contact set) Given il C clM, u : — ► RU {+oo} ; and c : 
cl(M x M) — > RU {+oo}, we define Dom c := {(x, x) G cl(M x M) \ c(x, x) < oo}, 
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the c-contact set 8qu{x) :— {x € clM \ (x,x) £ 9qu}, and d c u — d^ lM u, where 
(2.8) 8qU :— {(x,x) £ Dome | u(y) +c(y,x) > u(x) +c(x,x) for all y £ ft}. 
We define Dom9^u :— {x £ ft \ d^u(x) ^ 0} and Dom9 c u := Dom<9£ 1M M. 

3. Main Results and Examples 

A basic result of Loeper J2B] states that a cost satisfying (A0)-(A2) on a bi- 
convex domain AT = M X M C R™ x R™ is weakly regular (A3w) if and only 
if d c u(x) appears convex from x for each function u : M — > R U {+00} and 
each x € M. His necessity argument is elementary and direct, but for sufficiency 
he appeals to a result of Trudinger & Wang which required c-boundedness of the 
domains M and M in the original version of [33]. The same authors gave another 
proof of sufficiency for strictly regular costs in [35j , and removed the c-boundedness 
restriction in the subsequent revision of [34. Our main result is a direct proof 
of this sufficiency, found independently but simultaneously with [35 , under even 
weaker conditions on the cost function and domain geometry. In particular, the 



manifolds M and M in Theorem 3.1 need not be equipped with global coordinate 
charts or Riemannian metrics, the open set N C M x M need not have a product 
structure, and the weakly regular cost need neither be twisted nor strictly regular. 
This freedom proves useful in Examples |2.4| and |3.9| and Remark |3.10| 

Theorem 3.1. (Weak regularity connects c-contact sets) Use a cost c : 
cl(M x M) — > R U {+00} with non-degenerate restriction c £ C (JV) to define 



a pseudo-metric (2.1) on a horizontally convex domain N C M X M. Fix C 
cl-M, x £ M, and a set C cliV(x) which appears convex from x. Suppose 
(~^Q<t<iN(x(t)) is dense in for each geodesic t £ [0,1] — ► (x,x(t)) £ {x} x f2, 
and c : cl(iV) — ► R U {+00} is continuous. If c is weakly regular (A3w) on 
N, then Cl n d^u(x) is connected (and in fact appears convex from x) for each 
u : Q, ► R U {+00} with x £ Domu. 

To motivate the proof of this theorem and its relevance to the economics of 
transportation, consider the optimal division of mass p(-) between two target points 



y, z £ M in ratio (1 — e)/e. This corresponds to the minimization (1.1 1 with 
p(-) = (1 — e)Sy(-) + eSg(-). If c(x,x) is the cost of transporting each commodity 
unit from x to x, a price differential A between the fair market value of the same 
commodity at z and y will tend to balance demand e and 1 — e with supply p(-), 
given the relative proximity of z and y to the producers /?(•) distributed throughout 
M. Here proximity is measured by transportation cost. Since each producer will 
sell his commodity at z or y, depending on which of these two options maximizes 
his profit, the economic equilibrium and optimal solution will be given, e.g. as in 
[H)] , by finding the largest A £ R such that 

(3.1) u{x) = max{ A — c(x, y), —c(x, z)} 

yields e < p[{x £ M \ u{x) = —c(x, z)}] . 

Producers in the region {x £ M \ A — c(x, y) > — c(x, z)} will choose to sell their 
commodities at y, while producers in the region where the opposite inequality 
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holds will choose to sell their commodities at z; points xq £ M on the hypersurfacc 
c { x 0iV) ~ c ( x 0tz) — A of equality arc indifferent between the two possible sale 
destinations y and z. We call this hypersurface the valley of indifference, since it 
corresponds to a crease in the graph of the function u. Loeper's observation is that 
for optimal mappings to be continuous, each point xq in the valley of indifference 
between y and z must also be indifferent to a continuous path of points x(t) linking 
y — x(0) to z — x(l); otherwise, he constructs a measure p concentrated near xq for 
which the optimal map to a mollified version of p exhibits a discontinuous jump, 
since arbitrarily close producers will choose to supply very different consumers. 
Indifference means one can choose X(t) such that 

(3.2) u(x) > max X(t) — c(x,x(t)) 

0<t<l 

for all x £ M, with equality at xq for each t £ [0,1]. When the path connecting y 
to z exists, this equality forces X(t) = c(xo,x(t)) — c(xq,z); it also forces the path 



t £ [0,1] — > (xo,x(t)) to be a geodesic for the pseudo-metric (2.1l on (N,h), so 
the path {x(t) | < t < 1} appears convex from xq. 

We think of a function of the form x — > X — c(x, y) as defining the elevation of a 



mountain on M, focused at (or indexed by) y £ M. The function u{x) of (3.1 1 may 



be viewed as a double mountain, while the maximum (3.2 1 may be viewed as the 
upper envelope of a one-parameter family of mountains which slide as their foci x(t) 
move from y to z. The proof of the preceding theorem relies on the fact that the 
sliding mountain stays beneath the double mountain (while remaining tangent to 
it at xo), if the cost is weakly regular. In the applications below, we take £1 = clM 
and Cl = clN(x) = clM tacitly. 



Proof of Theorem 3.1 Let c £ C A (N) be weakly regular on some horizontally con- 
vex domain N C M x M. Fix u : il — ► R n {+00} and x £ Domw with 
y, z £ Cl n 8qu(x). This means u(y) > u{x) — c{y, z) + c(x, z) for all y £ Q,, the 
right hand side takes an unambiguous value in RU {—00}, and the same inequality 
holds with y in place of z. Apparent convexity of Cl from x implies there exists a 
geodesic t £ ]0, 1[ — ► (x, x(t)) in (N, h) with x(t) £ f2 which extends continuously 
to x(0) = y and 5(1) = z. The desired connectivity can be established by proving 
x(t) £ d^u(x) for each t £ ]0, 1[, since this means f2 D 8qu(x) appears convex from 
x. 

Horizontal convexity implies N(x(t)) appears convex from x(t) for each t £ [0, 1], 



so agrees with the illuminated set V{x,x{t)) = N(x(t)) of Definition 4.7 For any 
y £ r\o<t<iN (x(t)) , the sliding mountain lies below the double mountain, i.e., 
f(t,y) ■= -c(y,x(t)) + c(x,x(t)) < max{/(0 + ,?/),/(l",y)}, according to Theorem 



4.10 and Remark 4.12| Note that / : [0, 1] X O — ► RU {—00} is a continuous 



function, since t £ [0,1] — > (x,x(t)) £ clN and c : cl(iV) — ► RU {+00} are 
continuous and their composition is real-valued. We therefore replace + by and 
1~ by 1 and extend the inequality to all y £ il using the density of r\o<t<xN(x(i)). 
On the other hand, x(t) £ d^u(x) holds if and only if u(y) > u(x) + f(t,y) for 
each y £ and t £ [0, 1]. Since u(y) > u(x) + max{/(0, y),f(l, y)} by hypothesis, 
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we have established apparent convexity of fl n d^u(x) from x and the proof is 
complete. □ 

Remark 3.2 (Holder continuity). Since connectedness and apparent convexity sur- 
vive closure, we may replace find^u(x) by its closure (often d c u(x)) without spoiling 
the result. The apparent convexity of d c u(x) from x hints at a kind of monotonic- 
ity for the correspondence x G M — > d c u(x). A strict form of this monotonicity 



can be established when the cost is strictly regular (A3s) (Proposition B.ll, and 
was exploited by Loeper to prove Holder continuity F e C 1 ^ 4n_1 ^(JVf;clM) of 
the optimal map between densities p € L°°(M) and p with (1/p) € L°°(M) for 
costs which are strictly regular and bi-twisted on the closure of a bi-convex do- 
main M x M CC R 2n . Details of his argument and conclusions are given in the 
appendices below. 

Remark 3.3 (On the relevance of twist and apparent convexity to the converse). 
In the absence of the twist condition, we have defined apparent convexity by the 
existence of a geodesic, which need not be extremal or unique. When (A3w) fails in 
this general setting, Loeper's converse argument shows the existence of a geodesic 
segment with endpoints in ({x} x N(x)) n d c u but which departs from this set at 
some points in between. Since the twist condition and apparent convexity imply the 
existence and uniqueness of geodesies linking points in {x} x N(x), for a twisted 



cost the if statement in Theorem 3.1 becomes necessary as well as sufficient, a 



possibility which was partly anticipated in Ma, Trudinger & Wang [27 . 

Remark 3.4 (Product Domains). If N = M x M the hypotheses and conclusions 
become simpler to state because N(x) = M and N(x) = M for each (x, x) E N. If, 
in addition the product N = M x M C R 2n is a bounded Euclidean domain, we 
recover the result proved by Loeper [26] based on the regularity results of Trudinger 
& Wang [34], whose hypotheses were relaxed after [35] . 

Example 3.5 (The reflector antenna and conformal geometry). The restriction 
of the cost function c(x,x) = — \og\x — x\ from R™ x R n to the unit sphere 
M = M = S' 1 - 1 := <9B?(0) arises in conjunction 05] gT] with the reflector an- 
tenna problem studied by Caffarelli, Glimm, Guan, Gutierrez, Huang, Kochengin, 
Marder, Newman, Oliker, Waltmann, Wang, and Wescott, among others. It induces 
a metric h known to satisfy (A0)-(A3s) with N = (M x M) \ A. Note however, 
that c(x, x) actually defines a pseudo-metric /j/ 00 * 00 ) on the larger space R™ x R™\ A 
which is almost but not quite bi-convex. Here A := {(y, y) \ y 6 R n U{oo}} denotes 
the diagonal. For fixed a ^ a £ R™, 

(X,X). 2 i °g| a ,_ 5 |2| a _ i |2 

induces a pseudo-metric h^ a ' a ^ which coincides with ft,( 00 ' 00 ) on the set where both 
can be defined. Moreover, c ( - a,a ^ extends smoothly to (M a xM a )\A where M = R"U 
{oo} is the Riemann sphere and M a = M\{a}. Furthermore hS a ^ is independent of 
(a, a), so has a (unique) extension h to N := (M x M) \ A which turns out to satisfy 
(A2)-(A3s) on the bi-convex set N; as in [2B] [27] [33], this can be verified using 
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the alternate characterization of (A3w/s) via concavity (/ 2-uniform concavity) of 
the restriction of the function 

(3.3) q* ET*M > eg (x, C-Ex P:E q* ) 



to the nullspace of p G T X M in Domc-Exp^,; here the c-Exp map is defined at (4.3 1. 
The nullspace condition is crucial, since the value of this function is given by 

(3.4) w fa\w)-H- q *) = mp 1 ) 2 - wvi a > 

where the left-hand expression in (|3.4| coincides with the right-hand expression in 



(3.3 1 for general costs of the form c(x, x) — f(x — x). Homogeneity of the resulting 



manifold (N, h) follows from the symmetries N — [F x F)(N) and 

c (a ' a) (x,x) = c^' F ^(F{x),F(x)) 

under simultaneous translation F(x) — x — y by y G R™ or inversion F(x) — 
x/\x\ 2 of both factor manifolds; note the identity \x — x\ = |a:||a;|||x| _2 a; — |S|~ 2 5;|. 
This simplifies the verification of (A2)-(A3s), since it means infinity plays no 
distinguished role. Bi-convexity of N follows from the fact that the projection of 
the null geodesic through (z, x) and (y, x) onto M is given by a portion of the 
circle in R" U {oo} passing through z,y and x — the unique arc of this circle (or 
line) stretching from z to y which does not pass through x. From homogeneity 
it suffices to compute this geodesic in the case [z,x) = (oo,0); we may further 
take y — (1, 0, . . . , 0) using invariance under simultaneous rotations F{x) — Ax by 
A G 0(n) and dilations F(x) — Xx by A > 0. This calculation demonstrates that 
^o<t<iN(x(t)) is the complement of a circular arc — hence a dense subset of M 
- for each geodesic t € [0,1] — ► (x,x(t)) £ N. Notice that if x, x(0), x(l) all lie 
on the sphere M = M, then so does x(t) for each t E [0, 1]. After verifying that h 
coincides with the restriction of h to the codimcnsion-2 submanifold N, we infer for 
x G <9B™(0) that N(x) x {x} is a totally geodesic hypersurface in N(x) x {x}, so the 
horizontal and vertical geodesies, bi-convexity, and strict regularity of (N, h) are 
inherited directly from the geometry of (N , h), the strict regularity via Lemma 



4.5 



Although we lack a globally defined smooth cost on N, we have one on N, so 



the hypotheses and hence the conclusions of Theorems 3.1 and 4.10 are directly 
established in the reflector antenna problem, whose geometry is also clarified: the 
vertical geodesies are products of points on M with circles on M, where by circle we 
mean the intersection of a two-dimensional plane with M — <9B"(0). The geodesies 
would be the same for the negation c+(x, y) — +\og\x — y\ of this cost, which 
satisfies (A0)-(A2) on the bi-convex domain N, but violates (A3w) for the same 
reason that c satisfies (A3s). Using the Euclidean norm on our coordinates, the c- 



exponential (4.3l is given by x = c-Exp a ,(q*) := x — q*/\q*\ , the optimal map takes 



the form F(x) = c-Fxp x Du{x) 1 and the resulting degenerate elliptic Monge- Ampere 



type equation (5.1 1— ( 5.2 ) on R™, expressed in coordinates, is 



(3.5) det[uij(x) + 2ui{x)uj{x) — 5ij\Du(x)\ 



21 _ \Du(x)\ 2n p(x) 

p(x- Du(x)/\Du(x)\ 2 )' 
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Here p(x) = 0(|x| _2n ) as x — ► oo since p transforms like an n-form under coordinate 
changes. Using the isometries above to make Du(xq) — at a point xq of interest, 
a slight perturbation of the standard Monge- Ampere equation is recovered nearby. 
The operator under the determinant (3.5) is proportional to the Schoutcn tensor 
of a conformably flat metric ds 2 = e~ 4u X^Li^ 2 -') 2 ; so a similar equation occurs in 
Viaclovsky's <7„-version of the Yamabe problem [38] [39], which has been studied 
by the many authors in conformal geometry surveyed in |37j and [33j . 

Example 3.6 (Ricmannian manifolds). Consider a Ricmannian manifold (M = 
M, g). Taking the cost function to be the square of the geodesic distance c(x, x) — 
d 2 (x,x)/2 associated to g, induces a pseudo-metric tensor (2.1) on the domain N 
where c(x, x) is smooth, i.e. the complement of the cut locus. Moreover, the cost 
exponential (Definition |4.3[ ) reduces |29j to the Riemannian exponential 

(3.6) c-Exp^p* = exp x p 

with the metrical identification p* = g(p,-) of tangent and cotangent space. A 
curve t £ [0, 1] — > x(t) £ N(x) through a; is a geodesic in (M, g) if and only if 
the curve r{t) = (x(t),x) is a (null) geodesic in (N,h), according to Lemma 



4.4 



On the diagonal x — x, we compute h(p ®p, p (Bp) — g(p,p), meaning the pscudo- 
Riemannian space ( N, h) contains an isometric copy of the Riemannian space (M, g) 
along its diagonal A :— {(x,x) \ x £ M}. The symmetry c{x,x) — c(x, x) shows 
A to be embedded in N as a totally geodesic submanifold, and nullity of p p £ 
Tt XiX \N reduces to orthogonality of p with p. This perspective illuminates Loeper's 
observation |26j that negativity of one Riemannian sectional curvature at any point 
on (M, g) violates weak regularity (A3w) of the cost. Indeed, the comparison of 
( |4.9| with Lemma 4.5 allows us to recover the fact that along the diagonal, cross- 
curvatures in (TV, h) are proportional to Ricmannian curvatures in (M, g) : 

(3.7) sec^J (p © 0) A (0 ffi p) = | sec^^ p A p. 

Example 3.7 (The round sphere). In the case of the sphere M = M = S™ equipped 
with the standard round metric, the cut locus consists of pairs (x, x) of antipodal 
points d(x,x) — DiamM. Denote its complement by N = {(x,x) \ d(x,x) < 
DiamM}, where c(x,x) = d 2 (x,x)/2 is smooth. In this case, the identification 
(3.6 1 of cost exponential with Ricmannian exponential implies bi-convexity of N, 
since the cut locus forms a circle (hypersphere if n > 2) in the tangent space T x S n , 
and the verification of ( A3s) both on and off the diagonal was carried out by Loeper 
|26j . In fact more is true: (TV, h) is non-negatively cross-curved, as we verify in |24j . 
Given an /i-geodesic t £ [0,1] — > (x,x(t)) £ N, we find (lo<t<iN(x(t)) exhausts 
S" except for the antipodal curve to the exponential image t £ [0, 1] — ► x(t) of a 
line segment in T x S n . Although this curve does not generally lie on a great circle 
or even a circle, its complement is dense in S™, whence the double-mountain above 
sliding-mountain property and Theorem |3 . 1 1 follow . 

Example 3.8 (Positive versus negative curvature). To provide more insight into 
Loeper's results — continuity of optimal maps on the round sphere versus discontin- 
uous optimal maps on the saddle or hyperbolic plane consider dividing a smooth 
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positive density p (say uniform on some disk of volume X^=i e i — I) optimally be- 
tween three points x\, X2, S3 on a geodesic through its center: p(-) = JD i=1 eife;(')- 
The solution to this problem is given, e.g. [19], by finding constants Ai, A2, A3 G R 
for which the function 



(3.8) u(a;) 



max {iti (a;)} given by Uj(a;) : 

Ki<3 



solves £j = ppi] with 



Ai - c(x,Xi) 
f2, := {x 6 M I = Uj(x)} 



for i = 1,2,3. The regions f2j are illustrated in Figure 1 for the case where cost 
c(x,x) = d 2 {x,x)/2 is proportional to Riemannian distance squared on the (a) 
round sphere, (b) Euclidean plane, (c) hyperbolic disc, 
(a) (b) (c) 





Figure 1 

Notice that only in the third case is one of the regions fi 2 disconnected. Now 
consider transporting a smoothly smeared and positive approximation p v {-) of the 
discrete source p(-), obtained by mollifying on scale r\ > 0, to the uniform mea- 
sure p(-). For rj > sufficiently small, points x near a; 2 will need to map into a 
5- neighbourhood of f^- It is not hard to believe the results of Loeper's calculation 
for case (c), namely that if r\ > is small the map from p v (-) to p(-) will have a dis- 
continuity near x^ separating the regions which map into disjoint (^-neighbourhoods 
separating the two disconnected components of f^- 

Example 3.9 (New examples from old: perturbations, submersions, and products). 
It remains interesting to find more general sufficient conditions on a Riemannian 
manifold (M, g) and function / for the pseudo-metric h induced on the complement 
of the cut locus N c M x M by c(x, x) = f(d(x, x)) to be strictly or weakly regular. 
It is clear that slight perturbations of an (A3s) cost remain strictly regular if the 
perturbation is smooth and small enough. Dclanoe & Ge have quantified this obser- 
vation on the round sphere |16| . It is also possible to deduce that the local properties 
such as (A3s/w) and non-negative cross-curvature and the global property such 



as the conclusion of Theorem 3.1 all survive Riemannian submersion [24], holding 



for example on quotients of the round sphere under discrete |16] |24] or continuous 
[24 group actions, including in particular all spaces of constant positive curvature 
[TB] . and the Fubini-Study metric on CP™ [23]. On the other hand, an example 
constructed by one of us shows that positive but non-constant sectional curvature 
of the underlying manifold (M = M, g) does not guarantee weak regularity of the 
cost c — d 2 /2 away from the diagonal in N C M x M [23]. As a final important 
example, consider two manifolds N + C M + x M + and N_ C M_ x M_ equipped 
with cost functions c± € C 4 (N±) inducing pseudo-metrics d£± = —c^dx^dx 3 ^. As 
a consequence of Lemma 4.5 the product metric d£ 2 = d£ 2 + + di 2 _ corresponding 
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to the cost function c + (x + ,x + ) + c-(x-,X-) on N = {M+ x M_) x (M + x M_) 



is non-negatively cross-curved (2.3) if both (N±,h±) are. Although it is not true 



that weak regularity of the factors implies the same for the product, many of the 



known examples of weakly regular costs (including those of Examples 2.4 3.7 and 
the sumbersions above) actually turn out to be non-negatively cross-curved |24j , so 
this product construction becomes a fruitful source of new examples. Furthermore, 
since geodesies in the product are products of geodesies, bi-convexity of the factors 
(N±,h±) implies bi-convexity of the product manifold (N,h). Because a product 
geodesic may have constant factors, it is not hard to show that the non-negatively 
curved product manifolds (N, h) always fail to be strictly regular even when both 
factor manifolds (N±, h±) are positively cross-curved. Thus tensor products of pos- 
itively cross-curved costs yield a new source of weakly regular costs that fail to be 
strictly regular — the very simplest example of which is given by arbitrary sums 
c(x, x) — X)fc=i c { x > x i k) of k = 1, . . . , n positively cross-curved costs c(s, t; k) as 



in (2.7 1 on bi-convex subdomains Ni, . . . , N n C R 



2 



Remark 3.10. (Products S™ 1 x • • • x S™* x R ( and their Riemannian submersions) 
The conclusion of Theorem |3 . 1 1 holds for the distance squared cost on the Riemann- 
ian product M = M of round spheres S™ 1 x ■ ■ • x S" fc x R' — or its Riemannian 
submersions — by combining the preceding example with the result of [24 . The 
weak regularity (A3w) of the cost on N = M x M\{cut locus} and the biconvexity 



of N are satisfied as in Example 3.9 The density condition of Ho<t<i A(x(t)) follows 
also easily since the cut locus of one point in this example is a smooth submanifold 
of codimension greater than or equal to 2. This new global result illustrates an 
advantage of our method over other approaches [31] [35] [53], which would require 
a regularity result for optimal maps (or some a priori estimates) to obtain the con- 



clusion of Theorem 3.1 To implement such approaches for the manifolds of this 
example, one would need to establish that an optimal map remains uniformly away 
from the cut locus, as is currently known only for a single sphere M = M = S n 



from work of Delanoe & Loeper [T7] (alternately [5B] or Corollary E.2|. To the 
best of our knowledge, no one has yet succeeded in establishing regularity results 
for this product example, though one could try to obtain them from Theorems |3.1 
and |A.10| by extending the approach we develop successfully in Appendix [E] for a 
single sphere M — M = S n . 



4. Proof of Main Results 
Let us begin by establishing coordinate representations of the Christoffcl symbols 



and Riemann curvature tensor for the pseudo-metric (2.1) 



Lemma 4.1. (Riemann curvature tensor and Christoffel symbols) Use a 

non-degenerate cost c £ C A (N) to define a pseudo-metric (2.1) on the domain 
N C M x M . In local coordinates x 1 , . 
non-vanishing Christoffel symbols are 



,x n on M and x , . . . , x n on M , the only 



(4.1) 



P m — r n 



c ki 



and 



Y— m — r " 



c ki 
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Furthermore, the components of the Riemann curvature tensor (2.2) vanish except 
when the number of barred and unbarred indices is equal, in which case the value of 
the component can be inferred from RijM = and 

(4-2) %Rijkl = c ijkl ~ C liJ^ C ajk- 

Proof. From 

r y OT : = ^"^(^M + ^ ife J ~ hi^ k ) + -h mh {h kji + h ikj - h ijk ), 

and the analogous definition with and/or m replaced by i,j and m respectively, 
the off-diagonal form (2.1 1 of the pseudo- metric and the equality of mixed partials 
implies the only non- vanishing Christoffcl symbols are given by 

^ij" 1 = — ^C m ^( — Cjyj — Cjjy ; + 0) 
— c c kij 

and ri, = c mk c k ij. Since the only nonvanishing Christoffcl symbols are given by 
(4.1|, it is not hard to compute the relevant components of Ricmann's curvature 
tensor in the coordinates we have chosen: 

/,',,,'" = - i "; + 9 i"/ + t{ t% - i f i + r{ i - r; i ": 

l 3 k Q x i jk Q x j ik ki jf kj 1} ki jf k] if 

d 

= (r ma r —\ 

Q x i ^ C ajk) 

— r" lh r — -4- r™ h r -r a f r — 
— c c ijkb' c c bif c c ajk- 



Using the pseudo-metric (2.1l to lower indices yields (4.2 1, and the other non- 
vanishing components of the Riemann tensor can then be deduced from the well- 
known symmetries —RjiU — Rffla = Rku] = ~^ikil- To see that the remaining 
components all vanish, it suffices to repeat the analysis starting from the analogous 
definitions of R^f 1 , R ljk m , and R. l3k " 1 . □ 



Remark 4.2 (Vanishing curvatures). The vanishing of Rijki, Rij k f, Rijkh ^kjkh an d 

, . . — n imnlv that ( A 2 TM\ Oi ( /\ 2 TM\ li'oa In ihn mill 

V/A 



Rijkl = imply that {h 2 TM) © (A 2 TM) lies in the null space of curvature operator 



viewed as a quadratic form on A 2 TN = (A 2 TM) © (A 2 TM) © (TM A TM). Strict / 
weak regularity of the cost, and the signs of the cross-curvature (|2.3|, and sectional 



curvature (2.4 1 are therefore all determined by the action of this operator on the n 
dimensional vector bundle TM A TM. Since the cone of null vectors is nonlinear, 
it is not obvious whether (A3w) implies non-negativity of all cross-curvatures of 
(N, h), but Trudinger & Wang .84] prove, as in Example 3.5 that this is not so. 

We next recall an important map of Ma, Trudinger & Wang [27], called the 
cost- exponential by Loeper |26j . 

Definition 4.3. (Cost exponential) If c G C 2 (N) is twisted (Al), we define the 
c-exponcntial on 

Dom(c-Exp x ) := — Dc(x, N(x)) 

(4.3) ={p* e T*M | p* = -Dc(x, x) for some x € N(x)} 
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by c-Exp x p* — x if p* — —Dc(x,x). Non-degeneracy (A2) then implies the c- 
exponential is a diffeomorphism from ~Dom(c-Exp x ) C T*M onto N(x) C M. If 
c G C 2 {N) is non-degenerate but not twisted and q* = —Dc(x,y), the implicit 
function theorem implies a single-valued branch of c-Exp x taking values near y is 
defined by the same formula in a small neighbourhood of q* , though it no longer 
extends to be a global diffeomorphism ofDom(c-Exp x ) onto N(x). 

Lemma 4.4. (the c-segments of [27 are geodesies) Use a non-degenerate cost 
c G C 4 (-/V) to define a pseudo-metric (2.1) on the domain N C M x M. Fix 
x G M. If p*,q* G T)om(c-Exp x ) G T*M are close enough together there will be 
a branch of c-Exp x defined on the line segment joining p* to q* . Then the curve 

s G [0,1] ► cr(s) := (x,c-Exp x ((l — s)p* + sq*)) is an affinely parameterized 

null geodesic in (N,h). Conversely, every geodesic segment in the totally geodesic 
submanifold {x} x N(x) can be parameterized locally in this way. 

Proof. Given So G [0, 1], introduce coordinates on M and M around ct(so) so that 
nearby, the curve a(s) can be represented in the form (x , . . . , x n , x (s), . . . , x n (s)). 
Differentiating the definition of the cost exponential 

(4.4) = (1 - s)p* + sq* + Ci (a(s)) 
twice with respect to s yields 

(4.5) = c t] x k + c, ,,,■<■• -v k 

for each i — 1, . . . , n. Multiplying by the inverse matrix c rm to cq yields 

(4.6) = £" l + c" ll c l - fk x 1 x k , 

always summing on repeated indices. Together with x m = = x m , we claim these 
reduce to the geodesic equations 

(4.7) = x m +T lJ m x l x 1 +T Jj m xh 1 +T lJ m x i x ] +T rj m x 1 x ] 

o = x m + r ij m x l x J +r- ij n x 1 ^ + x % x^ + x l x^ 

on (N, h). Indeed, this follows since the only non- vanishing Christoffel symbols are 
given by (4.1|. Comparing (4.6) with (4.7 1 , we see <r(s) is an affinely parameterized 
geodesic near a(so); it is null since h(a, a) — from the off-diagonal form of (2.1 1. 

Conversely, any geodesic segment in (N, h) which lies in x x N(x) can be param- 
eterized affinely on s G [0, 1]. Near s G [0: 1] it then satisfies ( |4.5[ ), whence 

(4.8) 0=^ Ci (x,x(s)). 

Integrating twice, the constants of integration determine p* and q* G T*M such 
that (4.4) holds locally. Thus (1 — s )p* + s q* G Dom(c-Exp :E ). Choosing a branch 
of the cost exponential defined near this point and equalling Dic(x, x(sq)) there, 
we deduce (x,x(s)) = c-Exp 2 ,((l — s)p* + sq*) for s near So from the definition of 
this branch. 

Finally, to see that {x} x N(x) is totally geodesic, take any point x G N(x) 
and tangent vector p G T X M. Setting x m (s) = x m to be constant solves half of 
the geodesic equations, since TV^ = = TVi. we can still solve the remaining n 
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components of the geodesic equation (4.6) for small s E R, subject to the initial 



conditions a;(0) = x and x(0) — q, to find a geodesic which remains in the n- 
dimensional submanifold {x} x N(x) for short times. □ 

The next lemma gives a non-tensorial expression of the sectional curvature in 
our pseudo-Riemannian geometry (N,h). In the context of Example 3.6 it can 



be viewed as a generalization of the asymptotic formula for the Riemannian dis- 
tance between two arclength parameterized geodesies x(s) and x{t) near a point of 
intersection x(0) = x(0) at angle 9: 

(4.9) " ' ' • ■ ' h 



d 2 (x(s),x(t)) = s* +i z -2st cos 9- -s 2 t 2 sin 2 9 + 0{{s 2 + t 2 f /2 ) 

3 



where the Riemannian curvature k of the 2-plane i(0) A x(0) on [M = M,g) 
gives the leading order correction to the law of cosines. Though we do not need it 
here, the proof of the next lemma can also be adapted to establish an expansion 



analogous to (4.9 1 for general costs c(x(s), x(t)); the zeroth and first order terms 
do not vanish, but the coefficients of s 2 t and s 3 t are zero due to the geodesy of 
s £ [0, 1] — ► u(s). Remarkably however, to determine the coefficient of s 2 t 2 in the 
lemma below requires only one (in fact, cither one) and not both of the two curves 
to be geodesic. 

Lemma 4.5. (Non-tensorial expression for curvature) Use a non-degenerate 
cost c € C 4 (7V) to define a pseudo-metric (2.1) on the domain N C M X M. Let 
(s,t) € [—1,1] — > (x(s),x(t)) 6 N be a surface containing two curves a(s) — 
(x(s),x(0)) and r(t) = (x(0),x(t)) through (x(0),x(0)). Note © x(0) defines a 
parallel vector-field along <r(s). If s € [— 1,1] — ► er(s) £ N is a geodesic in (N,h) 
then 



(4.10) 



d 4 



ds 2 dt 2 



c(x(s),x(t)) 



s=0=t 



da dr 
sec (x(0)M0)) -A-. 



Proof. Introduce coordinates x , 



in a neighbourhood of x(0) on M and 



x 1 , . . . ,x n in a neighbourhood of x(0) on M, so the surface (x(s),x(t)) € N has 
coordinates (x 1 (s), . . . ,x n (s),x 1 (t), . . . ,x n (t)) locally. To see © 5(0) defines a 
parallel vector field along a(s), we use the Levi-Civita connection to compute 



b l \lix k + d-*V?±* = x l 



dx 
dx l 



+ x l T^x J + iTf,i J + = 



since the only non- vanishing Christoffcl symbols are given by (4.1 
Computing the fourth mixed derivative yields 



(4.11) 



ds 2 dt 2 



c(x(s),x(t)) 



s=0=t 



■ i • j • k ' I i -a ' k • I i / - ' i ' j i _--a\--b 

c ijkl x X X X -t C akl X X X + {C^jjX X ~t C ab X )X 

( c iklj ~ c Ma cah c bij)X % & X k X 1 



where the form (4.6 1 of the geodesic equation has been used t o elim inat e the coef- 
ficient of x b and express x a in terms of x 1 . Comparing (4.111 with (4.2 1 and (2.2) 
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yields the desired conclusion (4.101. The minus sign comes from antisymmetry 



Rikij — —RikjT °f the Riemann tensor. □ 



Our next contribution culminates in Theorem |4.10| which generalizes the result 
that Loeper deduced from Trudinger & Wang [34] . As mentioned above, it 
can be interpreted to mean that if a weakly regular function c £ C 4 (N) governs 
the cost of transporting a commodity from the locations where it is produced to 
the locations where it is consumed, a shipper indifferent between transporting the 
commodity from y to the consumer at either endpoint x(0) and x(l) of the geodesic 
t £ [0, 1] — ► (y,x(t)) in (N, h), will also be indifferent to transporting goods from 
y to the consumers at each of the intermediate points x(t) along this geodesic. As 
was also mentioned, for non-degenerate costs Loeper showed this conclusion fails 
unless the cost is weakly regular. 

Proposition 4.6. (Maximum principle) Use a weakly regular cost c £ C 4 (N) 



to define a pseudo-metric {2.1) on the domain N C M X M. Given M, 
let t G]0,1[ — > (x,x(t)) £ N be a geodesic in {N,h) with 2(1/2) ^ and set 
/(t) — —c(y,x(t)) + c(x,x(t)). If f(to) = for some t €]0,1[ with a geodesic 
linking (x,x(to)) to (y,x(to)) lying in N(x(to)) x {x(to)}, then f(to) > 0. Strict 
inequality holds if the relevant cross-curvature of c is positive at some point on the 
second geodesic. 

Proof. Suppose f(t) has a critical point at some < t < 1, so 

(4.12) 0= /(t ) =(d- i c(x,x(to))-d- i c(y,x(t )))x 1 (t ); 

we then claim /(to) > 0. 

Let s S [0, 1] — > (x(s),x(to)) be a geodesic in N(x(to)) x {x(t )} with endpoints 



x(0) = x and x(l) = y. Setting g(s,t) = — c(x(s), x(t)) + c(x,x(t)) 1 Lemma 4.5 
yields 



(4.13) 



ds 2 dt 2 



= \ sec {x{s)Mto)) {x{s) © 0) A (0 © x(t )) > 0, 

(s,to) 1 



with the inequality following from weak regularity (A3w) of the cost c € C 4 (iV), as 
long as i(s)©0 2 ( to ) and O x ( s )(Bx(to) are orthogonal vectors on (N,h), or equivalently, 
as long as x(s) © i(to) is null. These vectors are non-vanishing since x(0) ^ a;(l) 



and x(0) ^ 0; we now deduce their orthogonality from (4.121, using subscripts to 
distinguish which tangent space the zero vectors reside in. 

Along the geodesic s £ [0,1] — > (x(s) , x(to)) , the vector field X ( S ) © i(to) is 
parallel transported according to Lemma |4.5| Thus the inner product A of this 
vector field with the tangent vector is independent of s £ [0,1]. Define q|(s) :— 



c\c(x(s),x(t )) £ T*, to) M, so q?(s) = djdic{x{s), x(t ))i J (s). From the form (2.1) 
of the pseudo-metric we discover 



A = h(0 x{s) ®x(t ),x(s)®0 x(to) ) 
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Integrating this constant over < s < 1, (4.12 1 yields the desired orthogonality 



A = 4 i (i )(tf(0) -<£(!)) = 0. 



Now (4.131 shows d g/dt \ t= t a to be a convex function of s £ [0,1]. We shall 
prove this convex function is minimized at s — 0, where it vanishes. Introducing 
coordinates x , . . . , x n around x = x(0) on M and x , . . . ,x n around x(to) on M, 
we compute 



8*9 
dt 2 

d 3 g 



dsdt 2 



(s,*o) 



(a, to) 



(a, to) 
(Q,*o) 



cj(x(s), x(t))x l + qj(cc(s), x(t))x l x J 

(cr ik (x(s),x(t ))x l + Cl- lk (x(s),x(t ))x l X J )x k . 



When s = 0, the last line vanishes by the geodesic equation for t € [0, 1] — ► 
(x(0), i(t)), and the preceding line is manifestly zero. Thus the strictly convex 
function d g/dt \t—t must be nonnegative for s £ [0,1] and, as initially claimed, 
/(*o) = d 2 .9/<9i 2 | (M)=(Mo) is non-negative at any t <E ]0, 1[ where f(t ) = 0. 

If the relevant cross-curvature of the cost is positive at one point (x(sq), x(to)), 



then the non-negative function d 2 g/dt 2 \ t= t is strictly convex (4.131 on an interval 



I 2 g/dt 2 

at s = 1, to conclude the proof. 



around sq S [0, 1]; since d g/dt \t=t is minimized at s 



it must then be positive 
□ 



Definition 4.7. (Illuminated set) Given (x,x) £ N, let V(x,x) c M denote 
those points y £ N(x) for which there exists a curve from (x, x) to (y, x) in N(x) x 
{x} satisfying the geodesic equation on (N,h). 



As a warm up to the double mountain above sliding mountain Theorem |4.10| let 
us derive a strong version of this result under the simplifying hypothesis that the 
cost is strictly (A3s) and not merely weakly regular (A3w). 

Corollary 4.8. (Strict maximum principle) Use a strictly regular cost c £ 
C (N) to define a pseudo-metric (2.1) on the domain N C M x M. Let t £ 
[0,1] — ► (x,x(t)) £ N be a geodesic in (N,h) with x(0) ^ 0. Then for all 
y £ n tG [o i i]V(x, x(t)) the function f{t) = —c{y,x{t)) + c(x,x(t)) satisfies f(t) < 
max{/(0),/(l)} onO < t < 1. 

Proof. Let t £ [0, 1] — ► (x, x(t)) £ N be a geodesic in (N, h) with x(0) ± 0. Given 
y £ r\ te [ ^i]V(x, x(t)) and set f(t) = —c(y, x~(t)) + c(x, x(t)). Notice y, x £ N(x(t)) 
so / is C 4 smooth on [0,1]. Proposition 



4.6 



asserts /(to) > at each interior critical 
point /(io) — 0. Any critical point of / in ]0, 1[ is therefore a local minimum, and 
/ is strictly monotone away from this point. Thus fit) < max{/(0), /(l)} for 
< t < 1, as desired. □ 

If N is horizontally convex then V(x,x(t)) = N(x(t)), which motivates the 
relation of this corollary to Theorem |3.1| Let us now show a weak version of this 
maximum principle survives as long as the cost is weakly regular. To handle this 
relaxation we use a level set approach. 
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Lemma 4.9. (Level set evolution) Let g 6 C 2 (]e,e[x U) where U C R" is 
open. Suppose Dg — (dig, . . . ,d n g) is non-vanishing on ]e, e[ x U . Then the zero 
set S(t) — {x S U | g(t,x) — 0} is a C 2 -smooth n — 1 dimensional submanifold of 
U which can be parameterized locally for small enough t by {X(t,z) \ z G 5(0)}, 
where the Lagrangian variable X(t,x) solves the ordinary differential equation 



(4.14) 



dX(t,x) 



dg Dg 
dt \Dg\< 



(t,X(t,x)) 



subject to the initial condition X(t,x) — x. Moreover, the positivity set S + (t) — 
{net/ g{t, x) > 0} has S(t) as its boundary, and expands with an outward normal 
velocity given by 



(4.15) 



5X 

Ik 



Dg 



(t, 



\Dg\ 



Og/dt 



\Dg\ 



(t,X(t,z)) 



(t,X(t,z)) 

Proof. Clearly the boundary of S + (t) is contained in S(t). Since Dg ^ 0, the 
implicit function theorem implies S(t) is a C 2 -smooth hypersurface and separates 
regions where g(t,x) takes opposite signs. Thus S(t) is contained in and hence 
equal to the boundary in U of the positivity set. If the desired parametrization 
exists it must satisfy = g(t,X(t, z)). Differentiation in time yields an equation 



0=^(t,X(t,z)) + Dg(t,X(t,z)) 



Ik 



(M) 



easily seen to be equivalent to (4.14 1. Conversely, near a point (0, z) £ ]— e, e[x 5(0), 
the C 1 vector field (4.141 can be integrated for a short time (depending on z) to 
yield the desired parametrization. □ 

Theorem 4.10. (Double mountain above sliding mountain) Use a weakly 
regular cost c E C A (N) to define a pseudo-metric (2.1) on the domain N C M x M . 
Let t G]0, 1[ — ► (x,x(t)) £ N be a geodesic in (N,h). If ]to,ti[ x {y} lies in the 
interior of 



(4.16) 



A:={(t,y)e [0,1] xM\yeV(x,x(t))}, 



then f(t) = -c(y,x{t)) + c(x,x(t)) < max{/(£+), f(t^)} on < t < * < h < 1, 
where 



(4.17) 



lim f(t 

e\0 



/(*r 



lim/(t-e). 



Proof. Fix a geodesic t €]0,1[ — > (x,x(t)) G N with x(l/2) ^ 0, since otherwise 
the conclusion is obvious. Note that c-Exp^, and hence t € ]0,1[ — > x(t) are C 3 
smooth, from Lemma [4. 4| and (AO). For all y 6 M and t € ]0, 1[ set 

f{t, V) = - c (y, ®(t)) + c(x, x(t)) 
and note that f(t,y) is C 3 -smooth on the interior of A C [0, 1] x M. Define 



5+ 



{(t,y)eintA| ^>0} 
5 := {(t,y)€intA| g = 0}. 
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For each t € ]0, 1[, we think of f(t,y) as defining the elevation of a landscape over 
M, which evolves from f(y,0) to f(y, 1) as t increases, and is normalized so that 
f(x,t) — 0. We picture f(y,t) as a sliding mountain, with S + (t) :— {y £ M \ 
(t, y) £ S + } denoting the rising region, and S(t) := {y 6 M \ (t, y) 6 S} the region 
at the boundary of S + (t) which — instantaneously — is neither rising nor sinking. 
We claim the rising region S + (t) cMisa non-decreasing function of t £ ]to, t\ [. 



To see this, we plan to apply Lemma 4.9 to the C function 



(4.18) g(t, y)~-^ = ~Dc(y, x(t))x(t) + Dc(x, x(t))x(t) 

on A. Differentiating this function with respect to y £ M yields 

Dg{t,y) = -DDc(y,x(t))x(t) 

¥= o 



which is non- vanishing because of (A2). Applying Lemma 4.9 on any coordinate 
chart in M shows S(t) is the boundary of S + (t), and the question of whether S + (t) 
is expanding or contracting along its boundary is determined by the sign of dg/dt 
on S{t). 

From the definition of V(x, x(t')) observe y £ S(t') C V(x, x(t')) implies (y, x(t')) 
is linked to (x,x(t')) by a curve in N(x(t')) x {x(t')} which is geodesic in (N,h). 



Proposition 4.6 asserts d 2 f /dt 2 \( t , y ) > 0, so from Lemma 4.9 there is a neighbour- 
hood of y on which the rising region S + (i) does not shrink for a short time interval 
around t' e]0, 1[. 

Finally, fix y £ M and < to < t\ < 1 such that ]i 07 *i [ x {y} C intA and let 
U C ]to, ti[ denote the open set of times at which y ^ S + (t). If U is non-empty, we 
claim any connected component of U has t = to in its closure. If not, let t' £ ]t ,ti[ 
denote the left endpoint of a connected component in U. This means y £ S + (t') 
but y ^ S + (t / + S) for any 6 > 0, in violation of the non-shrinking property of 
S + (t) derived above. Thus t £ )t.Q,tx[ — ► f(t,y) is decreasing on an interval U — 
}to,t(y)[ for some t(y) £ [to,ti] and non-decreasing on the complementary interval 
]t(y),ti[. The limits (4.17) exist and the proof that f(t,y) < max{/(t f ), /(t^)} is 



complete. □ 



Remark 4.11 (A geodesic hypersurface bounds the rising region). Lemma 4.4 im- 
plies S(t) x {x(t)} is a totally geodesic submanifold for each t e]0, 1[ of the preceding 
proof. Indeed, (4.181 shows q* £ Dom(c*-Exp 2( - t - 1 ) C T*^M belongs to the hyper- 
plane (Dc(x, x(t)) + q*)x(t) = 0, if and only if y := c*-'Exp s ^q* lies on S(t). 

Remark 4.12. If the domain N is horizontally convex then V(x, x) — N{x) and A 



from (4.161 are open sets. Then ]0, l[x{y} £ A if and only if y £ no <t< iN(x(t)). 



Remark 4.13 (Enhancements). Villani has incorporated a version of our proof into 
[4"T] . adding simplifications which allow him to avoid the use of the level set method. 
He further develops our technique to prove additional results: contrast our Theorem 
|A.10[ which gives a new approach to a result of Loeper, with Villani 's Theorem 
12.41, which gives a new approach to a result of Trudingcr & Wang. 
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5. Perspective and conclusions 
Before concluding this paper, let us briefly review the connection of the trans- 



portation problem ( 1.1 1 we study with fully non- linear partial differential equations. 
Although this connection goes almost back to Monge [5U] , it has developed dramat- 
ically since the work of Brenier [3] . When an optimal mapping F : M — ► M exists 
and happens to be a diffcomorphism, it provides a change of variables between 
(M,p) and (M,p), hence its Jacobian satisfies the equation 

(5.1) p(F(x))\detDF(x)\ = p(x). 



Often when F is not smooth, (5.1l remains true almost everywhere [35] [H] [T]. 

Optimality implies that F(-) can be related to pair of scalar functions u : M — > 
R U {+00} and u : M — > R U {+00}, which arise from the linear program dual 
to ( |1.1[ ), and represent Lagrange multipliers for the prescribed densities p and p. 
Moreover, u can be taken to belong to the class of c-convex functions, meaning it can 



be obtained as the upper envelope of a family of mountains — analogous to ( 3.1 1 and 



(3.8 1 but with i ranging over p-almost all of M; similarly, u is a c*-convex function, 



as in Definition |A.3| below. When the twist condition holds, F = c-Exp o Du. If 



(5.2) det[u %j + Clj ] [x ^^ VscDu{x)) - n,.(„\\ I det c isl(x,c-Ex Pa! -D«(x)); 



non-degeneracy (A2) also holds, then (5.1 ) becomes an equation of Monge- Ampere 
type expressed as in [S7J using local coordinates around (x 1 , F(x')) by 

p(c-Jixp x Du(x)) 1 

c-convexity of u implies non-negative definiteness of the matrix Uij + of second 
derivatives, hence the equation is degenerate elliptic [TS] . Without further assump- 
tions on (M, p) and the geometry of (M, p) we can expect neither strict ellipticity 
nor regularity of solutions: if the support of p is connected but the support of p is 
not, this will force F to be discontinuous and u C 1 (A/). The equation is there- 
fore not locally smoothing, and the best one can hope is for solutions to inherit 
regularity from the boundary data (M,p) and (M,p). 



For the cost function c(x,x) = \x — x\ 2 /2 on M,M C R", (5.2 1 becomes the 
familiar Monge- Ampere equation for the convex function u(x) + \x\ 2 /2 [3]. In this 
case Caffarelli [5] was able to show convexity of M implies local Holder continuity 
of F if the densities dp/dvol G L°°(M) and dvol/dp £ L°°(M) are bounded, and 
smoothness of F on the interior of M if the densities log | dp/dvol | and log \ dp/dvo\ | 
are bounded and smooth; see also Delanoe [2] for the case n = 2, and Wang [12] 
for analogous results in the context of Example |3.5| the reflector antenna problem. 
The results of Loeper and Ma, Trudinger & Wang extend the Holder [26] and 
smooth [27] [35] theories to general bi-twisted, strictly regular costs on cl(M x M). 
Loeper in particular achieves stronger results such as a global Holder estimate 
with explicit exponent under weaker restrictions on p and p by exploiting strict 



regularity of the cost; see Theorems D.l and E.3 Trudinger & Wang extended 



the up-to-the-boundary regularity results of Caffarelli [4] and Urbas [36] — which 
require convexity and smoothness of M C R™ as well as M — to bi-twisted costs 
which are merely weakly regular [34 . For bounded and sufficiently smooth densities, 
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horizontal and vertical convexity of N — M x M takes the place of the convexity 
assumptions on the target M and domain M in these theories. 

Since we had not previously encountered pseudo-Riemannian geometry of any 
signature other than the Lorentzian one (n, 1) in applications, much less as the nec- 
essary and sufficient condition for degenerate elliptic partial differential equations 
to possess smooth solutions, a few words of explanation seem appropriate^] 

The regularity of optimal maps F : M — > M is a question whose answer should 
depend only on the cost function c{x,x) and the probability measures p and p. It 
should not depend on which choice of smooth coordinates on M and M are used to 
represent this data or the solution. Any necessary and sufficient condition on c(x, x) 
guaranteeing regularity of F should therefore be geometrically invariant, meaning 
coordinate independent. Thus pseudo-Riemannian geometry and curvatures arise 
in the theory of optimal transportation for the same reason they arise in Einstein's 
theory of gravity, general relativity: they provide the natural language for describing 
phenomena — in this case regularity — which exhibit invariance under the general 
group of diffeomorphisms. Put another way, the underlying physical reality is 
independent of whose coordinates are used to describe it. Moreover, this invariance 
places severe restrictions on the form which necessary and sufficient conditions for 
regularity can take: since the pseudo-metric h is equivalent to knowing the cost 
function c(x,x) — up to null Lagrangians v(x) + v(x) which are irrelevant to the 



optimization at hand (1.1) — any such conditions on the cost function must be 
expressible via the curvature tensor of h. 

Let us turn now to the question of why the intrinsic geometry of optimal trans- 
portation should be pseudo-Riemannian rather than Riemannian. Dimensional 
symmetry between the domain M and target M suggest that the number of time- 
like directions in the theory — if any — should equal the number of space-like 
directions. But why signature rather than the signature (2n,0), which is 

more frequently associated to elliptic and extremal problems? And why the nullity 
of p0p in (A3w)? 

The Riemannian notion of length allows us to associate a magnitude to any 
sectional curvature. However, null vectors have no length and cannot be normalized; 
because the plane (p © 0) A (0 © p) is generated by orthogonal null- vectors, we can 
decide the sign (positive, negative, or zero) of its sectional curvature, but not the 
magnitude. On the other hand, the results of Loeper reveal that the size of the 



constant C > in hypothesis (1.2 1 controls the Holder constant of the mapping 
F : M — > M . Unlike the exponent, which is coordinate-independent, the Holder 
constant of F obviously depends on the choice of coordinates. We are therefore 
relieved to find the cross-curvature condition governing regularity does not have an 



2 However, as we learned subsequently from Robert Bryant, the wedge product u>Au> can be used 
to define a signature (3, 3) pseudo-metric on the space R 4 A R 4 ; in four dimensions, the difference 
between positive sectional curvature and positive curvature operator amounts to the question of 
whether the curvature operator is positive-definite only on the light cone with respect to this 
pseudo-metric, or on the full space; c.f. [2]- In one way this parallels the distinction between strict 
regularity and positive cross-curvature of a cost; in another it parallels the distinction between 



positive cross-curvature and positive sectional curvature, I 2.3 I — i2.4i 
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associated magnitude, since the problem has no intrinsic length scale. To be scale 
free, the geometrical structure which governs regularity for optimal transportation 
must be pseudo-Riemannian, since the modulus of continuity of a map F has no 
intrinsic meaning in the absence of separate notions of length on M and M, which 
the cost function c(x, x) alone cannot provide. What it can and does provide 
are geodesies on N(x) x {x} and {x} x N(x), and geodesic convexity of these null 
submanifolds is the essential domain hypothesis in Ma, Trudinger & Wang's theory. 
In the present investigation, we have focused exclusively on the pseudo-metric 



dt = ~c i jdx l dx J induced by the cost (2.1). Let us conclude by noting that there 



is also a canonical symplectic form u> = d(Dc ® 0) = —d(0 © Dc) on N C M x 
M associated to the cost c £ C 4 (N). In local coordinates x 1 ,. . . ,x n on M and 
x 1 , . . . , x n on M it is given by 

1/0 DDc 
(5 - 3) U : = 2 { -DDc 

It is possible to verify that any c-optimal diffeomorphism F : M — > M has a graph 
which is spacclike with respect to h and Lagrangian with respect to uj. Conversely, 
for a weakly regular cost, results of Trudinger & Wang [34] or Villani [41] can be used 
to deduce that any diffeomorphism whose graph is /i-spacelike and w-Lagrangian 
is in fact the c-optimal map between the measures p := Tr^(vol^ N ' h ^ |craph(F)) an d 
p := tt #(yo\^ N |Graph(F)) obtained by projecting the Ricmannian volume vov N ' h ' 
induced by h on Graph(i ? ) through the canonical projections ir(x,x) — x and 
n(x,x) = x. This reveals another unexpected connection between optimal trans- 
portation and symplectic (or pseudo-Kahler) geometry. When p and p are given 
by the Euclidean volumes on two convex domains, and c(x,x) = \x — x\ 2 /2, this is 
related to the work of Wolfson [45] and Warren [44] on special Lagrangian subman- 
ifolds, where a pseudo-Riemannian metric of signature (n, n) also appears (44] . We 
defer the details of this development to a future work. 



Appendix A. Regularity consequences for optimal transportation 

This series of appendices is devoted to explaining how the results obtained above 
combine with Loeper's ideas to simplify the proof of his Holder continuity results for 
optimal mappings with respect to cost functions which are strictly regular (A3s) — 
meaning they have positive cost-sectional curvature in the language of Loeper 26J . 
To emphasize that this argument is completely self-contained, we recall the neces- 
sary details of his proof in full, exploiting where possible the conceptual framework 
developed above. We often present variations on his arguments, but do not claim 
originality for the conclusions. 

The plan can be outlined as follows. In the present section we recall the relevant 
facts of life from the literature, and derive the key local-implies-global ingredient 
- a variant of which appears in Trudinger & Wang |35] — as a simple corollary 
to our main theorem above. In Appendix [5] we recall Loeper's Taylor expansion, 
which uses strict positivity of orthogonal cross-curvatures to quantify the altitude 
of the double mountain over the sliding mountain near the point in the valley of 
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indifference where the sliding mountains are normalized to coincide. Appendices [C] 
and [D] explain how Loeper used this result to obtain Holder continuity of optimal 
mappings between suitable measures on Euclidean domains. Appendix |E| employs 
the same argument in a simpler setting to obtain continuity of optimal mappings 
between suitable measures on the round sphere, when optimality is measured ei- 
ther against (a) Ricmannian distance squared [26 , or (b) the reflector antenna cost 
function, as in [26 and Caffarelli, Gutierrez & Huang [5]. As a corollary, we recover 
directly a bound on how closely an optimal map may approach the cut locus, which 
can be used in place of Delanoe & Loeper's estimate [T7J in the next step. Once the 
continuity of the optimal map has been established, Caffarelli, Gutierrez, Huang 
and Loeper's Holder regularity results concerning optimal maps on the spheres (a) 
[2"rJ] and (b) [5] [25] can be recovered using bi-twisting (Al) alone without further 
resort to the local-implies-global theorem: the problem is easily localized when con- 
tinuity is known. This approach is considerably more direct than Loeper's original 
argument, which established the local-implies-global theorem using approximation 
by smooth solutions to a family of auxiliary problems constructed by combining 
Delanoe's continuity method [TS] with Delanoe & Loeper's control on proximity to 
the cut locus [T7] , and the a priori estimates of Ma, Trudinger & Wang [57] . 

A.f . c-convex functions and their properties. The definition of c-convex func- 
tions and some basic properties are discussed. We first introduce the notion of a 
supporting mountain (or c-mountain), also called c-support functions, c-planes or 
c-graphs. The focus (or c-focus) of such a mountain is defined below, and was called 
the c-normal by Trudinger & Wang, in analogy with the Gauss map and generalized 
normal in the theory of convex bodies. 

We shall find it convenient to continue to assume M and M are open manifolds 
whose closures cl M and cl M are compact in some larger space, while JVcMx Al 
intersects neither M x DM nor (dM) x Al. 

Definition A.l. (Supporting mountains) Fix c : cl(M x Al) — > RU {+oo}. 
A mountain refers to any function f on clM of the form 

/(•) = -<•,*) + A 

with (A,x) S R x clM. The mountain is said to be focused at x. The mountain f 
is said to support u : clM — ► RU {+00} at x € clM if u(x) — f{x) < +00 and 

u(y) > f(y) 

for all y S clAf. The mountain is said to support u to first order at x £ Al if 
u{x) — f{x) < +00, and M has a manifold structure near x with 

(A.l) u(y)> f(y) + o(\y-x\) as y -> x. 

Remark A. 2 (Twisting identifies focus) . If a mountain / supports u to first order at 
a point x € Al where u happens to be differentiable, and / is known to be focused 
in N(x), then twisting (Al) identifies the focus x = c-^xp x Du(x) uniquely, since 
( AT} then implies Du{x) — Df(x) — —Dc(x,x). 
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Definition A. 3. (c-convex function focused in Cl) Fix c : cl(M x M) — ► 
RU {+cxd} and a subset Cl C cl M . Then u : cl M — ► RU {+00} belongs to S^~ c if 
Domit := {x £ M \ u(x) < +00} is non-empty and there exists v : Cl — > RU{+oo} 



such that u = v%, where 



vfi(x) := sup —c(x, x) — v(x). 

2Gfi 



Functions in the class S^~ c are sometimes called c-convex, though they might also 
be called (—c)-convex, since they are supremal convolutions with — c. Our subscript 
keeps track of the set Cl of foci; we say u G S^ c is a c-convex function focused in Cl. 

Lemma A. 4. (Properties c-convex functions inherit from the cost) Fix 

metric spaces (M,d) and [M,d), a cost c : cl(M x M) — ► RU {+00} and subset 
Cl C cl M. Recall c*{x,x) :— c(x, x) . 

(a) Legendre-type duality: If u e S^ c then u — (u^ M )%. 

(b) Semicontinuity: If c : clM x Cl — > R is continuous and u S then u : 
c\M — > RU {+00} is lower semicontinuous and d c u C cl(M x M) is closed. 

(c) Lipschitz: If f2 C cl M and u £ S^ c then 

Lip(u,fi):= sup -^f ^ < supLip(-c(-,x),n). 

QByjizen a(y,z) sen 

(d) Semiconvex: If (M,d) = (R",| • |) and D 2 c(-,x) < CI holds in the (matrix) 
sense of distributions on a domain Cl C R™, with C — C(fl,Cl) € R independent of 
x € Cl, then each u £ S^ c satisfies D 2 u > —CI in the sense of distributions on SI. 

Proof. All four facts are well-known; (b) is elementary to check and the other three 
statements are proved, e.g., in [IT], where (a) is Proposition 5.8, and (c) and (d) 
are established in the proof of Theorem 10.26. □ 

Corollary A. 5. (Lipschitz/semiconvex) If c is a locally Lipschitz function on 
the (compact) closure ofcl(M x M), then u S is locally Lipschitz on clM and 

clM C Dom<9 c u. //, in addition, c is locally semiconcave, then u G is locally 

semiconvex (A. 2) on M . 

Proof. The fact that u G <5~^ inherits Lipschitz or semiconvexity properties from 
the cost follows directly from Lemma A. 4 c) and (d), using compactness of clM to 
conclude that the local semiconcavity constant C of c(-,x) and Lipschitz constant 
Lip(— c(-, x), Q) are independent of x G clM. To see clM C Dom9 c u, use Lemma 



|A.4[ a) to write 

u(x) = sup ~c(x,x) — u^. lM (x). 



2 Gel M 



Here u c clM is lower semicontinuous by Lemma A.4'b) so the supremum is attained 
at some x £ clM, whence 



u(x) + c(x,x) =-u c clM {x) 
= — sup — 

yedM 

< u(y) + c(y,x) 



sup ~c{y,x)-u{y) 

yGclM 
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for all y G clM. Thus (x,x) G d c u as desired I pTs] ). □ 

Note that c is both locally Lipschitz and semiconcave if it is weakly regular on 
cl(N), or else if c = d 2 /2 is the distance squared on a compact Riemannian manifold 
(M = M, without boundary as in Example 3.6 see Cordero-Erausquin, McCann, 
Schmuckenschlager [13] . This is not the case for the reflector antenna cost function 
of Example |3. 5 1 but the associated c-convex functions are Lipschitz and semiconvex 
anyways — unless they are mountains — as the following proposition shows. Here, 
as above, semiconvexity of u : M — > R means that near each point z G M there is 
a coordinate ball B r (z) C M and coordinate-dependent constant C < 00 such that 
D 2 u(x) > — Cl as matrices distributionally in these coordinates, or equivalently 
that 

(A.2) xeB r (z) — > u{x) + C\x\ 2 /2 

is a convex function, where |x| denotes the Euclidean norm of the coordinates. 
Semiconcavity of u means — u is semiconvex. 

Proposition A. 6. (Semiconvexity of reflector antenna potentials) Fix the 

cost c(x,x) = — log \x — x\ on the Euclidean unit sphere S" = <9£>i(0) C R™ +1 . If 

u : S™ ► R U {+00} belongs to S^Z and takes finite values at two or more points, 

then u is locally lipschitz and semiconvex on S™. In coordinates, its Lipschitz and 
semiconvexity constants do not depend on u except through maxs" u — minsn u. 

Proof. Since u € 5g„ c , defining the dual potential u(-) = sup^gg^ log | • —x\ — u(x) 
yields 

(A. 3) u(-) = sup log I • —x\ — u(x) 

x£S™ 

similarly to Lemma A.4[ a). We shall show that both u and u are bounded from 
below and above. Since u(-) takes finite values at xq ^ X\ in S™, we have u(-) > 
max i=0 .i l°g I ' — u(xi) > const > —00, where the constant depends only on 
max{u(xo),u(xi)} and \xo~ Xi\. Thus u(-) is bounded below, hence u(-) is bounded 
above from ( |A.3 ) since \x — x\ < 2 for all x, x G S™. 

On the other hand, if Domu := {x S S™ | u(x) < 00} consists of one or fewer 
points y, we reach the contradiction u(y) = —00. So Domfi consists of two or 
more points, and the same argument as before yields u(-) bounded below and u(-) 
bounded above. 

To show u is semiconvex, recall that any supremum of smooth functions which 
have locally uniform control on their C 2 norms is semiconvex, as in Lemma A.4'd). 
Let —x denote the antipodal point to a fixed point ieS". If | A — Ax | < A, the 
function 

v(-) := max{log | • -x\ - A ,log | • +x\ - AJ 
is semiconvex on S", with a semiconvexity constant C depending only on A < +00, 
but independent of Ao, Ai G R and x G S". Setting A = (sup s „ u) — (infsn u), we 
see from (A. 3 1 that 

w(-) = sup maxjlog | • —x\ — u(x), log | • +x\ — u(— x)} 
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is a suprcmum of such functions v(-), hence semiconvex with constant C as de- 
sired. Finally, any bounded convex function is locally Lipschitz, hence bounded 



semiconvex functions are locally Lipschitz by (A.2|. □ 



Definition A. 7. (Super- and subdifferential) The subdifferential du C T*M 
of a function u : M — ► R U {+00} on a smooth manifold is defined to consist of 
those points (x,q*) £ T*M in the cotangent bundle such that in local coordinates 
around x, 

(A. 4) u{y) > u{x) + q*(y — x) + o(\y — x\) as y — ► x. 

It follows that du(x) := {q* £ T*M \ (x,q*) £ du} is a convex set, and du(x) = 
{Du(x)} at those points x £ DomDn C M \ dM where u is diffcrcntiablc. The 
superdifferential —d(—u) consists of those pairs (x,q*) £ T*M which satisfy the 
opposite inequality. We set Domdu := {x £ M \ du{x) ^ 0}, so that DomDu = 
(Dom9u)n(Domi9(— u)). If a function u on M is locally Lipschitz and semiconvex, 
then du C T*M is closed and M C Domchx. 

Let us now hypothesize an extension of the twist condition to mountains focused 
in cl(M) as follows. 

Definition A.8. (Extended twist) A cost c : cl(M x M) — > RU{+oo} satisfies 
the extended twist condition (Al') if each mountain f is subdifferentiable on M n 
Dom/, where Dom/ := {y £ c\M \ f(y) < 00}, and if there is a continuous 
map c-Exp : T*M n cl(Domc--EKp) — ► clM, such that each x £ M, mountain 
/(■) = —c{-,x) + X, and extreme point p* of df{x) satisfy (x,p*) £ cl(Domc-Exp) 
and x = c-Exp' x p* . 

One way to satisfy (Al') is to assume (Al) extends to some larger manifold N in 
which M x M CC N is compactly embedded, which we abbreviate by saying (Al) 
holds on cl N. Property ( Al') is also verified for the examples discussed hereafter: 
any compact Riemannian manifold (M — M , g) with the geodesic distance squared 



cost c = d 2 /2 of Example 3.6 and the reflector antenna cost c(x, x) = — log \x— x\ on 
the sphere S" x S" as in Example 1 3. 5 1 The key to verification in the Riemannian case 
is that the Riemannian exponential is continuously defined throughout the tangent 
bundle, the cut-locus has no interior, and mountains are semiconvex |13j . so extreme 
points in df(x) of a mountain / are limits of gradients p* — lim^oo Df(xk) from 
nearby points of differentiability Xk £ Domfl/; c.f. §25.6 [32 . 

The extended twist condition (Al') yields the following lemma and theorem. 

Lemma A. 9. (Tangent mountains support c-convex functions globally) 

Suppose a cost c : cl(M x M) — ► RU {+00} twisted on N C M x M satisfies the 
extended twist hypothesis (Al'). If u £ is differentiable at x £ M n Domi9 c u 

then (x,Du(x)) £ clDomc-Exp and d°u(x) = {x} with x := c-Exp' x Du{x) . Any 
mountain supporting u to first order at x is focused at x and supports u globally. 

Proof. Fix a point x £ M <~) Dom<9 c u where u £ <S~[j£f ^ s differentiable. Then 
u(x) < +00 and there exists x £ d c u(x) such that the mountain /(•) = — c(-, x) + A 
with A = c{x, x) + u{x) supports u at x. Now let <?(•) = — c(-, y) + fi be any other 
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mountain which supports u to first order at x. Then g(y) < u{y) + o(\y — x\) = 
u(x) + Du{x){y— x)+o(\y— x\) is superdifferentiable at x; since g is subdifferentiable 
by hypothesis, we conclude Dg(x) — Du(x). In this case dg(x) = {Du(x)}, has 
Du(x) as an extreme point, so the extended twist hypothesis implies g is focused 
at y = c-Hxp' x Du(x). Applying the same argument to / instead of g yields x = 
c-Fixp' x Du(x), hence x, — y and g = f + const. The constant vanishes since g(x) — 
f(x), so we conclude any mountain g which supports u to first order at x E Dom Du 
actually supports u globally. □ 

We are now in a position to extend this lemma to points where u is not diffcr- 
entiable by applying Theorem |3.1| This extension is the key local-implies-global 
ingredient required for Loeper's argument. Our proof of Theorem |A . 1 0| is similar to 
the proof of Proposition 2.12 [ZB] . Under the stronger hypothesis (A3s), Trudingcr 
& Wang's corrigendum contains a different approach to a similar result [35j (but 
with strictly regular costs and a stronger hypothesis on the domains) . 

Theorem A. 10. (Mountains supporting to first-order support globally) 

Use a continuous cost c : cl(M x M) — > RU{+oo} which has a twisted and weakly 



regular (AO)— (A3w) restriction c E C i (N) to define a pseudo-metric (2.1) on a 
horizontally convex domain N C M x M. If c is unbounded assume that whenever 
{xk,Xk) G Dome converges and c(xk,Xk) —> +oo then 

(A. 5) lim \Dc{xk, Xk)\ = +oo. 

k — >oo 

Fix x € M such that clM appears convex from x, and suppose Ho<t<i-/V (x(t)) is 
dense in M for each geodesic t S [0, 1] — ► (x,x(t)) € {x} x clM. Fix u £ ^iSd 
for which there is a coordinate neighbourhood U C M of x contained in Dom d c u, 
and a constant C < oo such that each mountain f : clM — ► R which supports u 
to first order at some y £ U satisfies |V/| < C and D 2 f > — Cl distributionally 
in the given coordinates on U . If the extended twist hypothesis (Al') holds, any 
mountain which supports u to first order at x will be dominated by u throughout 
clM. 

Proof. Take u E <5T^ and x E U C Dom<9 c u \ dM as in the theorem. Assume 



x Dom Du, since otherwise the conclusion follows immediately from Lemma A. 9 
We see u is Lipschitz and semiconvex on U by Lemma |A.4[ after noting that u agrees 
throughout U with the suprcmum of those mountains / < u which support u on 
M and make contact with u on U C Dom3 c u. The subdifferential du(x) C T*A1 
is a non-empty convex set from the semiconvexity of u; it is bounded since u is 
Lipschitz. Let us first show that if p* is an extreme point of du(x) then c-Exp^p* E 
d c u(x). Extrcmality implies x is the limit of a sequence x^ E U D DomDu such 
that p* = lim^oo Du{xk), according to §25.6 |32j . Since d c u{xk) is non-empty, 



Lemma A. 9 yields a sequence of mountains /&(•) = — c(-,Xk) + c(xk,Xk) + u(xk) 
supporting u and focused at points Xk = c-¥ixp' Xk Du(xk) which converge to a limit 
x := c-Exp^,p*. Since \Dc(xk, Xk)\ is bounded by the Lipschitz constant C of u on 



U, (A. 5 1 implies c{xj~,Xk) has a finite limit. It then follows from u(-) > /fc(-) that 



/(•) = — c(-, x) + c(x, x) + u(x) supports u on clM. 
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According to Theorem 3.1 the set d c u(x) of foci corresponding to mountains 
h(-) = — c(-, z) + c(x, z) + u(ir) which support w at i appears convex from x. Thus 
there is a geodesic in N connecting (x,x) to (x,z). From the twist condition, this 
means set of cotangent vectors q* such that c-Exp^g* £ d c u(x) form a convex set 
in T*M. This convex set contains du(x), since it contains the extreme points of 
du(x). Finally, suppose a mountain g(-) = — c(-,y) + /i supports u to first order at 
x. The semiconvexity of g(-) yields some q* extremal in dg{x) with y — c-Exp^,<7*. 



Moreover, q* G du(x) a fortiori (A.ll, hence there is a mountain / also focused at 
y which supports u at x. Now g = f since their foci and values at x coincide, so 
the theorem is established. □ 

Remark A.ll (the sphere, the reflector antenna). Although the hypotheses of Theo- 
rem [XtU] appear complicated, they are obviously satisfied on any product manifold 
N = M x M on which (A0)-(A3w) extend to a larger manifold N DD N which 
contains N compactly. For the distance squared cost c=d 2 /2ona compact and 
boundaryless Riemannian manifold (M — M,g), the cost function is globally Lip- 
schitz and semiconvex [15] . hence all mountains /(•) = —d 2 (-,y)/2 + \i obey the 
desired local estimates. Taking N C M x M to be the complement of the cut 
locus, the only delicacies are bi-convexity of N and the density of lHo<t<i N(x(t)); 
on the round sphere these two properties are satisfied, as Loeper knew and we dis- 
cuss elsewhere. On S™ x S", with the reflector antenna cost c(x, x) — — log \x — x\ 
of Example |3.5| the mountains are smooth away from their foci, so choosing the 



diameter of U small relative to the Lipschitz constant C of u, (A. 5 1 combines with 
Proposition |A.6| to force all mountains which support u to first order in U to have 
foci well outside of U. 

Remark A. 12 (products S" 1 x ■ ■ ■ x S™* x R' and their Riemannian submersions). 
Theorem A.10| holds for the distance squared cost on the Riemannian product 



M = M of round spheres S" 1 x • • • x S" fc x H, 1 or its Riemannian submersions. The 
necessary hypotheses are satisfied according to Example |3.9| and Remark |3.10| 



Appendix B. Estimating the height of the double mountain relative 

TO THE SLIDING MOUNTAIN NEAR THEIR POINT OF COINCIDENCE 

The following local estimate is the main result of this section; due to Loeper 
[26], it is crucial to his argument for Holder continuity of optimal mappings. One 
can view it as a localized version of the double mountain above sliding mountain 



principle (Theorem 4.101, in which strict regularity of the cost is used to quantify 
the relative altitude of the double mountain over the sliding mountain in a neigh- 
bourhood of the point where they coincide. For this entire section, we shall work 
in a single Euclidean coordinate patch; the norm \p\ 2 = p ■ p and derivatives such 
as D 2 c — {cij} and DDc = {cfj} refer to these coordinates. 

Proposition B.l. (Local double-above-sliding- mountain estimate [55]) Let 

c 6 C 4 (cl N) be strictly regular (A2)— (A3s) on the closure of product N = M X M 
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of two bounded domains M, M C R n , so that the pseudo-metric (2.1) obeys 
(B.l) sec (x , s) (p © 0) A (0 (Bp) > 2C \p\ 2 \p\ 2 

for each (x,x) G N and null vector p © p € T^ X ^N , the norms \p\ and \p\ being 
defined using the Euclidean inner product on the global coordinates. Then there 
exist positive constants r<j — ro(DiamAf, 1 1 [Z5Z?c] — 1 1] c° ( jv) : ||c||c 4 (jv)j Co) and C\ — 
Co||2I)5c||^,o(7v)ll[-^ ) -^ ,c ] 1 llco(Af) suc h that: for each coordinate ball B r (x) C At of 
radius < r < r and geodesic t € [0,1] — ► (x,x(t)) € TV with x(0) ^ 0, the 
function f(t,y) := —c(y,x(t)) + c{x, x{t)) satisfies 

max[/(0, y),f(l,y)] > f(t, y) + C x t(l - t)\x(l) - x(0)| 2 |y - x\ 2 - \\c\\ CHN) \y - x\ 3 
throughout (t,y) 6 [0, 1] x B r (x). 

Proof. Our proof diverges from Loeper's [26 , but has the advantage of producing 
a radius r > independent of t g [0, 1]. 

Let D and D 2 denote the gradient and the Hessian operators with respect to the 
fixed Euclidean coordinates. For any fixed w € T X M, Lemma |4. 5| asserts 

^[D 2 f(t,x)](w,w) = ^ 

whence 



(B.2) ^[D 2 f(t,x)](w,w) = ^sec {xMt)) (w®0) A (0 «(*)), 



alt 2 



[D 2 f(t,x)](w,w) 



< ^V| 2 |^)| 2 sup \R m {x,x{t))\. 



from (2.2), and the Cauchy-Schwarz inequality (X)"=i \ wl \) 2 < n|w| 2 . 

On the other hand, setting p*(t) :— Df(t,x) = —Dc(x,x(t)) yields the linear 
parameterization p*(t) = p*(0) + tq* as in ( |4.4| . From this, it is easy to see that 
w © x{t) is an /i-null vector at (x, x(t)) if and only if q*w l — where 



(B.3) 



dt 



(t)=-c i j(x,x(t))x 1 (t). 



Note q* ^ because x(0) ^ 0. For any vector w in the nullspace of q* , (B.l I and 
(pOl imply 



dt 2 



{D 2 f(t,x)}(w,w)>C \w\ 2 \x(t)\ 2 



To make these estimates independent of t, note (B.3 1 combines with nondc- 
generacy (A2) of the cost to imply that x(t) and q* have comparable Euclidean 
magnitudes 

1 i^<i^t)i5ni[^ric°wi<?*i. 



(B.4) 



\DDc\\ c o (N ) 



It follows that 



(B.5) 



|S(1)- 5(0)| < / \x(r)\d 



^III^c]- 1 ! 



c°(N)\q 
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Combining these estimates, we see 
12 t/j. _Mr.„ «„\ . 





d 2 


(B.6) | 


dt 2 
d 2 




dt 2 


where 




(B.7) 




(B.8) 





X)}(W,W) < C[\w\ 2 \q*\ 2 
^[D 2 f(t,x)](w,w) >C' \w\ 2 \q*\ 2 



for general w 

for each w with q*w l = 



C'^C /\\DDc\\ 2 c0(N) , 

C[ = \n 2 \\[DDc]- x f CO{N) s_upjE iM ||c°(iV)- 



In the following, we use (B.6) and a Taylor expansion argument. We work in 
a small ball B r {x) C M oi x, whose radius r shall be determined in the course 
of the proof; note that the condition B r (x) C M is used only to have the Taylor 
expansion. Consider a thin cone 



Kg := {v E T x n 



cos 6 < t^Vt < cos 9} 
~ \q*\\v\ ~ 



around the nullspace of q* . Here 7r/4 < 6 < ir/2 will subsequently be chosen to be 
large, depending on C' and C[. We shall establish the theorem by taking 9 as ir/2 
in case 1 and r > small in Case 2 below. 

Case 1 (Second order): Let v S B r (0) n Kg. Then 

max[/(0, x + v),f(l,x + v)] — f(t, x + v) 

> tf(l,x + v) + (1 - f)/(0, x + v)- f(t, x + v) 

> [tD 2 f{l,x) + (1 - t)D 2 f(0,x) - D 2 f(t,x)](v,v) - ||c|| C 3 W | W | 3 

where the first order term v(tp*(0) + (1 — t)p*(X) — p*(t)) of the Taylor expansion 
has vanished due to the linearity of p*(t) — p*(0) + tq* . To bound the second 
term using (B.6 1, rotate the Euclidean coordinates so that v = (vi,V), where 
V\ = q*V/\q*\ and (0,V) lies in the nullspace q* . Note that for v e B r (0) (1 Kg 
we have < |u| cos 9 and hence |w| 2 sin 2 9 < \V\ 2 . After the mean value theorem, 
fOll yields 



[tD 2 f(l, x) + (l- t)D 2 f(0, x) - D 2 f(t, x)](v, v) 
> \C Q t{l - t)\q*\ 2 \V\ 2 l -C[t{l t)\q*\ 2 {2\ Vl \\ 



> 



2^° 
1 

2 

1 

4 



+ \vi\ 2 ) 



C' sm 2 9-3C' 1 cos9 t(l - t)\q* 



> 7 C' t(l-t) 



x(l)-x(0) 



\[DDc]-i\\ c °(N) 

noting ( |B.5[ ), and provided we choose 7r/4 < 9 < tt/2 sufficiently large that 
(B.9) sin 2 6» > - + ^cosf 



This establishes the desired inequality for Case 1 for 9 close to tt/2. 
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Case 2 (First order): Let v £ B r (0) \ Kg, and recall 

p*(l)-p*(t) = (l-t)q* 
p*(0)-p*(t) = -tq*. 

Now either q*v l < — \q*\\v\ cos 9 or |g*||v| cos 9 < qtv" 1 . \{q*v l < 0, Taylor expansion 
yields 

max[/(0, x + v),f(l, x + v)] - f(t, x + v) 
>f(Q,x + v)-f(t,x + v) 

= (pj(0) -*>*(*) K + (D 2 f(0,x) - D 2 f(t,x))(v,v) + 0(\v\ 3 ) 
> tcos9\v\\q*\ - t\\D 2 Dc\\ c o (N) max {|i(r)|}|t>| 2 - \\c\\ CH n) \v\ 3 

0<r<i 

^tHI^^cose-Hcllos^ll^Dcl-^lco^lt;!) - ||c|| C 3 (Ar) M 3 
>t|w||x(l)-5(0)|(||[£> J D C ]- 1 ||^ (JV) co S e-||c|| C 3 W |«|) -|Mlc»(JV)M 3 
where (B.4 |-( |B~5| have been used. Similarly, if q*w l > 0, then 



max[/(0, x + v),f(l, x + v)]- f(t, x + v) 

> (l-tJHI^lJ-xCOjI^D^^y^COBd-llclloawH) - Hc||c3(iV)|w| 3 . 



If we assume 

l,m " 1 'lc°(JV) 



(B.10) \v\<r<r 



[DDc^Wcl^ cos( 



\ c \\c 3 (N) + Ci DiamM 
then for v € B r (0) \ Kg with either sign of q*v l we find 

max[/(0, x + v),f(l,x + v)] — f(t, x + v) 

> dt(l - t)\v\ 2 \x(l) - x(0)\ 2 - ||c|| c3(iV) |«| 3 . 

Combining Case 1 with Case 2 establishes the proposition; the constants have 



the desired dependence according to (B.7|, (B.8l, (B.9), (B.10), and (4.2 1. □ 



Appendix C. On fitting sausages into balls 

Loeper's Holder continuity argument for optimal mappings F : — ► Cl between 
domains M = C R™ and M = Cl C R™ is based on a localized volume compari- 
son. To set this up, for any optimal map which fails to be C 1 / 5 ^; cl Q), the next 
proposition identifies e, S > and a null geodesic t e [0, 1] — ► (x,x(t)) 6 x f2, 
such that an e-ball in coordinates around x and a <5-sausage Af$ around the middle 
third of x(t) satisfy F~ 1 (Afs) C B e (x). More precisely, we establish Proposition 



C.2 for the potential function u of the mapping F = c-Exp o Du. 

We start by defining (5-neighbourhoods and relative ^-neighbourhoods. As usual, 
[x,y] denotes the Euclidean convex hull of the two points x,y € R™, and we write 
Y CC O to indicate that the closure of Y is a compact subset of Cl. 
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Definition C.l. (Relative ^-neighbourhoods) Given Y C £1 C R", denote 
Afs(Y) — {x G R" | there exists y G Y such that \x — y\ < 6}, 
Mf(Y) = {x G (1 \th ere exists y G Y such that \x — y\ < S 
and the line segment [x, y] C Cl}. 

Note Nf{Y) = fl nAf s (Y) if Cl C R" is convex. The point of this section is the 
following version of Loeper's Proposition 5.3 [2rJj . 

Proposition C.2. (Fitting sausages into balls) Let c G C 4 (c\N) be strictly 
regular (A2)— (A3s) on the closure of the product N = f2 X Q of two bounded 
domains f2 and f2 C R". Fix u : £1 — > R semiconvex with constant C, such that 
any mountain focused in clfi and supporting u to first-order is dominated by u 
throughout fl. Choose a pair of points (xq,Xq) and (xi,Xi) from d c u C cl(f2 x Cl) 
with xq ^ x\ and [xq,Xi] C £1. From the definition of d c u — O^qU, both mountains 

(C.l) fi(-) = -c(-,Xi) + c(Xj,Xj) + u(xi), i = 0, 1, 

support u, hence there exists a point x where the valley of indifference /o(x) = 
/l(x) intersects the line segment [xo,Xi] C fl. Suppose a geodesic t G]0,1[ — > 
(x, x(t)) G N for the pseudo-metric (2.1) links xq = lim t ^o x(i) to x\ = lirnt_>i x(t). 
Then taking C ,C ,C > sufficiently small depending only on ro and C± from 



Proposition B.l the semiconvexity constant C, || c|| c 3 (JV) ^ and Dia.mil, implies: if 
|xi — xq\ < (C'C") 2 \xq — xi| 5 then for any e > in the non-empty interval 



(C2) ± J^4H< e <C"|x -x 1 | 2 , and 

u y |x - xi| 

(C3) 6 := C""e |x - xi| 2 , 

suc/i £/ia£ B e (x) CC we Ziawe [cl-B e (x) x {z}] n <9 c u non-empty for each z G 
A/?({x(t) | §<*<§}). 

Proof. We present a modified version of the original proof of Loeper |26] . 

In the course of the proof, S > will be determined. Given z G A/| 2 ({x(i), t G 
[1/3,2/3]}), define the mountain 

Si (•) := -c(-, z) + c(x, z) + u(x) 

focused at z which agrees with u(-) at x but need not be a supporting mountain. 
Our goal is to specify e,S > with B e (x) CC ft such that 

(C.4) u(-) > g-^(-) throughout dB e (x). 

Once this is established, it implies the existence of z which realizes the global mini- 
mum of u(-) — on cl-B e (x). Adjusting the height of /•?(•) by u{z) — g g (z) yields 
a mountain which supports it(-) to first-order at z, hence globally by hypothesis. 
This implies (z, z) G d c u as desired. 

By construction, u > max[/o,/i] on f2. The choice of z implies t G [1/3,2/3] 
exists with \z — x(t)\ < S and 

(C.5) the line segment [z,x(i)] C f2. 
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Assume fo(x) = = fx(x) without loss of generality, so 

ft(-) :=-c(-,x(t)) + c(x,x(t)), te[0,l], 



extends (C.ll. Let e < ro be chosen to satisfy B e (x) CC f2 later, where ro is from 
Proposition B.l Then for each y 6 dB t (x), 

u(y) - 9z{y) > max[/ (y), A(y)] - g z {y) 

(C.6) > f t (y) - gM + dt(l - t)\x - Xl \ 2 \y - x\ 2 - ||c(-, 0||c"(jv)ltf ~ ^l'- 

To achieve ( |C.4[ ) we want the right hand side nonnegative. First note 

ft(y) - 9z{y) = ~c{y, x{t)) + c(x, x(t)) + c(y, z) - c(x, z) - u(x) 
(C.7) > -|| C || c2(JV) |y - x\\z-x(t)\ - u(x) 

by second derivative estimates; this estimate is the only place where we use the 



condition ( C.5 1 in the proof. Recall x = (1 — s)xq + sx\ for some s £ [0, 1]. Semi- 



convexity implies 

u(x) < (1 — s)u(xq) + su(xi) + C\xq — xi\ 2 /2 

where 
(1 — s)u(xq) + su(xi) 
= (1 - s)fo(x ) + sfx(x{) 

< (1 - s)Df (x)(x -x) + sD/i(x)(xi -x) + — ||c||c 2 Cjv-) (l^o - x| 2 + |xi - x\ 2 ) 

< s(l - s)[Df (x) - Afi(z)]Oo - xi) + ||c|| c = W |x - xi| 2 /2 

< s(l - s)||£>£>c|| c o (JV) |xo - xi||x -xi\ + ||c|| C 2 W |xi - x Q \ 2 /2 
Therefore, 

u(x) < \\DDc\\ c o m \x - Xl \\x - Xl \ + (\\c\\ C 2(n) + C)\xq - a;i| 2 /2. 

From ( C.6[ ) and ( C.7| , the desired inequality (C.4| reduces to the following: 

< Cit(l - t)\x - xi| 2 e 2 - ||c|| C 2 (A r)e5 - ||c|| C 2 (J v)|xo - Xi\\x - x x \ 

- (ll c llc 2 (W) + C)\x x - x | 2 /2 - \\c\\ C 3 {N) e 3 

which can be satisfied if the negative terms are small relative to the positive term 
Cii(l - t)\x - xx\ 2 e 2 > | S - £i| 2 e 2 - This will be the case if 

<5 ^ 1*0 - Si | 2 e, 
- %i\ ^ l*o - ^i|e 2 , 

e < 1*0 - xi| 2 

where all the relevant constants (for <) depend only on C, C\, ||c||c 2 (a), an d 
II c ll C 3 (iv) ■ Fixing suitable constants C , C" and C" implies the choices ( C.2 1 and 
(C.3) suffice, provided C" is taken small enough depending on r and Diamfj 
to have e < r as required. For the interval (C.2 1 to be non-empty, we need 
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\xo — xi\ < (C'C") 2 \xq — x\\ 5 , which is enough to ensure compatibility of the 
preceding four inequalities. □ 

Appendix D. Holder continuity of optimal transport by volume 

COMPARISON 

Let us now show how the Holder continuity of optimal mappings — Loeper's The- 



orem 3.4 and Proposition 5.5 [26] — follows from Theorem A. 10 Proposition B.l 
Proposition |C.2| and Loeper's volume comparison argument, which we now recall. 
Note that Holder continuity of the optimal transport map F = c-Exp o Du implies 
Holder differentiability of the corresponding potential u £ <S~7 vv. 

Theorem D.l. (Holder continuity [26]) Assume c £ C 4 (c\N) is bi-twisted and 
strictly regular (Al)— (A3s) on clN, where N = il x c R" x R™ is a bounded 



domain bi-convex with respect to the pseudo-metric (2.1 ). Fix m > 0, and let p and 



p be probability measures on fl and with Lebesgue densities dp/dvol > m through- 
out fj and dp/dvol £ L°°(0). Then there exists a map F £ ^Vn««{Mn 1 }^Q jC iq^ 
between p and p which is optimal with respect to the transportation cost c. 

Proof. Note c £ C 4 (cl7V) since strict regularity extends to a neighbourhood of 
clN by hypothesis. By Kantorovich duality, there exists u £ <5T^ such that the 



optimal 7 in ( 1.1 ) vanishes outside d c u = d^ lQ u [31] [40]. Moreover, both u and its 
transform u := u^in are globally Lipschitz and semiconvex from Lemma 



our hypotheses on the cost. By Corollary A. 5 clf2 C Dom<9 c u. Since u(x) + u(x) 



A.4 



and 



c(x, x) > with equality on d c u 7 the bi-twist condition yields d c u n (Dom Du x 
DomDii) C Graph(i ;l )nAntigraph(G7) where F := c-ExpoDu and G := c*-ExpoDu 
[T8] [9] [27]. Here Antigraph(G) := {(G(x),x) \ x £ Domfli}, and Dom Dm x 
DomDw C £1 x Cl is a set of full mass for p ® p, a fortiori for 7. Moreover, 
x = G(F(x)) holds for p-a,.c. x £ il, and x — F(G(x)) for vol-a.e. x £ Ct. We shall 
establish the theorem following [26 . 

Since N is horizontally-convex and (Al') is satisfied, Theorem A. 10 ensures 
the function u satisfies the hypotheses of Proposition |C.2| namely that that any 
mountain supporting u to first-order supports u globally. Fix r\ > 0, and let fl^ := 
Cl\M n (dCl). Then xq,Xi £ with |xo — < rj/2 implies [xo,xi] C 0^/2; so any 
e < 77/2 implies Af e ([xo, x±]) CC ft. 

Pick distinct points xq,x± £ with (a;,, Xi) £ d c u(xi) n (DomDu x DomDu), 
i = 0,1, such that 

(D.l) \x - Xl \ < mm{(C'Cy\x - Xl \ 5 , C^f\x - x x \, |}. 

If there no such pair of points exists, then F £ C 1 / 5 (ri l; ; cl fl) with a Holder constant 
depending on 77, C", C", and the Euclidean path diameter of Choosing x as in 
Proposition |C.2[ the vertical-convexity of N yields a geodesic t £ [0, 1] — > (x, S(t)) G 
iV connecting (a;, So) to (x,xi). 

Unless (a;o,a;o) and (xi,Xi) can be chosen to make e = (|xq — ^i|/|3o — x^) 1 / 2 /C 1 



and hence 5 = C"'e\xQ — X\\ arbitrarily small in (C.2 )-(C.3 1, the map F is Lipschitz 
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on ft,,. Noting < e < rj/2, Proposition C.2 yields 

(D.2) Afp({x(t),t G [1/3,2/3]}) c d c u{B t {x)). 

Since the cost c is bi-twisted, the map G :— c*-Exp o Du is well-defined on the 
set Dom Du C ft of differentiability for u := u° in . This set has full Lebesgue 
measure (its complement has Hausdorff dimension n — 1), and if (x,x) G d c u with 
x G Dom Du then x = G(x). Now ( |D.2[ ) asserts 

(D.3) 5 e (» 3 ({a(t),i G [1/3,2/3]}) n Dom Du \ nDomUit 

= F~ 1 (Nf ({x(t),t G [1/3,2/3]}) n Dom .Du) n Dom Dm. 

Since P#/0 = p has density dp/dvol > m, integrating p over these two sets yields 

(D.4) ||p|U~(Q) volB e (x) > mvol(J\fp({x(t),t G [1/3,2/3]})). 

For c G C 4 (N) and twisted, c-Exp : Dc(x,Cl) — ► ft gives C 3 diffcomorphism. 
The pre-image of {x(^)}te[i/3 .2/3] under this map is a segment in Dc{x, ft) C T*ft 
according to Lemma |4.4| Noting that bi-convexity of N implies convexity of 
Dc(x, ft), it is not hard to see 

(D.5) vol(M?i{x(t),te [1/3,2/3]})) >vol(M s ({x(t),t e [1/3,2/3]})) 

for 5 > sufficiently small depending on c and ./V but not on xq, x\ or x. Recalling 
(C.3 ) then yields 

_n 11 _ 1 

L°=(0) 



(D.6) ^IMlL-m^m^-^o-Sil 



_ frtlll \n— li- — |2n — 1 

= m(C e) |xo — a;i | 

Thus whenever e = (|xo — Xi|/|5o — Si|) 1/,2 /C" and hence S is sufficiently small we 
find 

IMIi~(Q)|zo - zi| > (mC') 2 (C"') 2 "- 2 |xo - xxl 4 - 1 

so F G C 1 /' 4 ™ -1 ) U C°' 1 (ri, ; , cl ft). Since 77 > was arbitrary, the proof is complete. 

□ 

Remark D.2 (Rougher measures). Note that no smoothness is required of the mea- 
sures in the preceding theorem: p need not even be absolutely continuous with 
respect to Lebesgue and the support of p need not be connected. Replacing the 
hypothesis dp/dvol G L°°(ft) by the existence of p G]n,+oo] and C(p) > such 
that 

(D.7) p(B e {x)) < C(p)e n ~ n / p for all e > 0, x G ft, 

Loeper [25] was able to conclude F G C\£ U C { loc p )/( ™ 1 p ' (ft, cl ft) by assuming 
n > 2 and modifying the passage from (D.3 1 to (D.4 1 in the obvious way in the 
argument above. Replacing ( D.7[ ) by the even weaker condition that there exists 
/ : R+ — > R+ with lim £ ^o /(e) — such that 

(D.8) p(B e (x)) < /(e)e"- 1 for all e > 0, x G ft, 
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Loeper was able to conclude continuity of F : CI — ► cl CI similarly. In these cases p 
need not be absolutely continuous with respect to Lebesgue measure, though (D 



precludes it from charging rectifiable sets of dimension less than or equal to n — 1 . 

Remark D.3 (Holder continuity of the boundary map). Strictly regular and bi- 
twisted on cliV means N — CI x CI is compactly contained in a larger domain 
N' C R™ x R™ on which c £ C A (N') remains strictly regular and bi-twisted (AO)— 
(A3s). If we allow N' <£_ R" x R n to be a manifold which contains cl(iV) compactly 
we see the boundedness hypotheses on M = CI and M = CI become unnecessary 



in Propositions B.l C.2 and Theorem D.l If a bi-convex product domain N' — 



CI' X Cl CC N exists with Cl CC Cl' , then for n > 2 Theorem D.l asserts F £ 
C I 1 < 4 ( * n ~ 1) (n , ,cin) ) hence F £ C^^-^^clCl). 

Appendix E. Continuity of optimal transport on the sphere 

This section presents a continuity result of optimal transport on the sphere as 
an a priori step toward Holder continuity. It is known from the work of Cordero- 
Erausquin, McCann & Schmuckcnschlager |13j that on a compact n-dimensional 
Riemannian manifold (M = M, g) with the Riemannian distance squared cost 



c = cr/2 of Example 3.6 the optimal map stays away from cut locus for p-almost 
all points, when the measures p and p do not charge rectifiable sets of Hausdorff 
dimension n — 1. On the round sphere, Delanoe & Loeper provided a quantita- 
tive estimate of the distance separating F{x) from the cut locus of x in terms of 



|| log dp/d vol ||l°°(s™) and || logdp/dvol ||l°°(S")- In Corollary E.2 we use the pre- 
ceding ideas to recover a form of this statement which, though less sharp than the 
result of [T7], still suffices for our purpose: namely, to give a self-contained proof 



of Loeper's Holder continuity result on the sphere, Theorem E.3 It is not known 
whether optimal maps with respect to strictly regular costs c = d 2 /2 on other 
Riemannian manifolds stay uniformly away from the cut locus, however. 

Theorem E.l. (Continuity of maps on the sphere) Let M = <9Bi(0) C R' i+1 
be the Euclidean unit sphere, also denoted S", equipped with either cost: 

(a) c(x,x) = dist 2 (ir, x)/2, where dist denotes the great circle distance in S n , 

(b) c(x,x) — — log |x — x\ where \ ■ \ denotes the Euclidean distance o/R ra+1 . 



Fix probability measures p,p on the round sphere S n , one satisfying (D.8) and the 
other having a density dp/dvol > m > with respect to Riemannian volume on 
S™. Then some continuous map F on S™ between p and p is optimal with respect 
to the cost function c and takes the form F = c-Exp o Du with u G C7 1 (S") n . 

Proof. Let N C S™ x S n denote the set of points where c is smooth, and A = 
S" x S n \ N its complement, the singular set: 

• for cost (a), A = {(ar, —x) | —x is the antipode of x £ S™} 

• for cost (b), A = {(x,x) | x £ S™}. 



Note the pseudo-Riemannian metric h of (2.1 1 is strictly regular on N, and N is bi 



convex with respect to h. Moreover, this bi-convexity is 2-uniform (a fortiori strict) 
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in the sense that Dc(x, N(x)) C T*S n is either (a) a disc of radius n = Diam S n or 
(b) the entire cotangent space. 

The form F — c-Exp o Du of the optimal map and Lipschitz potential u E 6>g„ c 
can be found in (a) McCann |29j . where u is semiconvex according to Cordero- 
Erausquin, McCann & Schmuckenschlager |13j . The same form can be deduced 
for cost (b) analogously, using the Lipschitz and semiconvexity from Proposition 
|A.6| To derive a contradiction, suppose u is not diffcrentiable at some point 
x G S". Since u is globally Lipschitz and semiconvex this means the compact 
convex set du(x) C T*S" consists of more than one point. Choose two distinct 
co- vectors p* q* extremal in du(x). Extremality yields sequences Xk — > x 
and yk — > x in DomDw such that Du{xk) —* P* and Du{yk) — > q* by §25.6 



[35]. Lemma A. 9 asserts (xk,Du{xk)) € clDomc-Exp and provides a support- 
ing mountain — c(-,Xk) + c(xk,Xk) + u(xk) for u at Xk which is focused at Xk = 
c-Hxp' Xk Du(xk) ■ The limit k — > oo yields a mountain supporting u at x and fo- 
cused at x = c-Exp^p* ; for the same reason, another mountain supporting u at x is 
focused at y = c-Exp^.q*. Whether or not these mountains are distinct, a geodesic 
f' €]0, 1[ — > (a;,z(<')) in {x} x iV(a;) which extends from (x,x) to (x,y) is defined 
by z(t') := c-Exp a .[(l — t')p*+t'q*\ according to Lemma 4.4 Applying Theorem 3.1 



to (a) Example |3J| and (b) Example [375| yields z{t') £ d c u(x) for all e [0, 1]. 

To exploit Proposition |C.2[ first localize the domains to subdomains of R™ in the 
following way. From the strict vertical convexity of S™ x S"\ A, there is a subdomain 
Cl CC N(x) containing {^(^')}t'e[i/4,3/41- Reparameterize this portion of the curve 
to get a geodesic t € [0, 1] — > (x, x(i)) with distinct endpoints x(0) — z(l/A) and 
5(1) = z(3/4). Choose a coordinate neighborhood c S" of x which corresponds 
to a bounded domain CC R n ; Q can be mapped to bounded domain in R" by 
using its diffeomorphic preimage under c-Exp^,. 

The rest of proof is an easy application of Proposition |C.2| to this case xq = x = 
X\ and xq = x(0) ^ X\ = x(X); we employ the notation of that proposition. Note 



that u satisfies the hypothesis of Proposition |C.2| according to Theorem A. 10 



We apply Proposition C.2 to Graph F C d c u. First notice from ( C.2 1 that e may 
be chosen arbitrarily in the range ]0, C"\xo — Xi\ 2 ]. To compare the volume of the 
ball B e (x) to the sausage J^f({x(t),t € [1/3,2/3]}), we repeat steps (|L\3|-(|L\6| 
from the proof of Theorem |D . 1 1 to obtain 



m(C" , e)™- 1 |x -ii| 5 



= mS n \x - xi\ 

< P[Be(x)} 

= o(e"- 1 ) ase^O, 



where the last equality is given by (D.8l. The limit e — > forces xo — x\ 



contradiction! We therefore conclude u is differentiable, which implies u G C 1 (S") 



by the closedness of du noted after (A.4|. The continuity of F — c-ExpoDu follows 
immediately. □ 

Corollary E.2. (Uniformly Away from Singular set [T7]) Assume the hy- 
potheses of Theorem \E.1\ namely (M — S™,dist,vol) is the round sphere and 
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F : S n ► S™ is an optimal map between a probability measure p satisfying (D.8) 

and one satisfying dp/dvol > m > for the (a) Riemannian distance squared or 
(b) negative logarithm of the Euclidean cost. Let JV C S" x S™ be the set where c 
is smooth. There exists a constant S = S(p,p) > such that p-a.e. point x £ S" 
satisfies 



(E.l) 



dist(F(a;),S n \ JV(x)) > S. 



Proof. Since the map F is unique (up to p- negligible sets) |29j . Theorem E.l asserts 

' 5 — 

it costs no generality to take F — c-Exp o Du continuous on S n and u c c = u £ 
C^S"). Recall for cost (a) N(x) = {x £ S n | &ist{x,x) < tt = Diam S n S n }, 
while for cost (b) N(x) = {i £ S™ | i / x}. In case (b) the desired estimate 
follows directly from the Lipschitz bound on Du provided by Proposition A. 6 since 
|Z?c(x,i)| — > oo as x — > x. If the corollary fails in case (a), there is a sequence 
of points Xk £ S" with dist(-F(iEft), S™ \ N(xk)) — > as k — > oo. Extracting a 
subsequential limit yields it -i i £ S" with F(x) £ N(x) — Ran c-Exp^. antipodal 
to x. Due to the symmetry of the sphere, the convex set du(x) cannot then consist 
of a single point of non-zero magnitude; instead du(x) — {p* £ T*S n | \p*\ < tt} 
contradicting the claim u £ C 1 (S"). □ 

Finally, we collect the ingredients above to conclude our direct proof of (a) 
Loeper's result and (b) his analogous improvements of Caffarelli, Gutierrez & 
Huang's result for the reflector antenna [H]. 

Theorem E.3. (Holder continuity of optimal maps on the sphere [26], [8]) 
As in Theorem\E.l\ assume p and p are probability measures on the round sphere 



(M = S™, dist, vol) satisfying (D.7) for some p £]n, oo] and dp{x)/d vol > m > for 
all x £ S". For either cost (a) c(x, x) = dist 2 (a;, x)/2 or (b) c(x, x) = — log \x — x\ 



the optimal map F between p and p lies in C 1 / 5 U C^ 1 p ' > ^ 4r ' 



Proof. We first set up a domain N to which we shall apply the proof of Theorem |D.l| 
Let gtj denote the standard metric tensor on the round sphere S™. For each z £ S n , 
let — z denote its antipodal point. Observe that each g-geodesic ball Br(z) appears 
convex from z £ S™ for the Riemannian distance squared cost (a), and from — z for 



the negative logarithmic cost (b); see Definition 2.5 In both cases, the convexity 



is 2-uniform in the sense that Dc(z, Br(±z)) is a 2-uniform subset of T z S n . 



Let z be an arbitrary point in S n and S = S(p, p) as in (E.l ). Fix 7r — 8 < R < n. 
By compactness, we can choose 0<r<S + R — n sufficiently small that the set 
N = B r (z) x Bh(±z) remains vertically-convex, choosing the plus sign for cost 
(a) and the minus sign for cost (b). If we use g-geodesic polar coordinates we can 
regard the geodesic balls as Euclidean balls in the tangent spaces T z S n and T± z S n . 

We now apply the volume comparison argument of the proof of Theorem |D.1| to 
the domain N. A few points should be clarified. Horizontal convexity of JV was used 



in Theorem |D . 1 1 only to be able to apply Theorem A. 10 But in the present context, 
we need not apply Theorem A.10| since we have its conclusion for u on the whole S n 
directly by combining Theorem E.l with Lemma A. 9 Also F satisfies {(x, F(x)) \ 



£ B r (z)} C JV by Corollary E.2 The remainder of the proof of Theorem D.l goes 
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through in the present case, yielding F G C^ 5 U c£ c ^ )/(4 ™ 1 »\B r (z), S n ). This 
local estimate becomes global by compactness of S", so the proof is complete. □ 
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